Universal Computation and Optimal Construction in the Chemical Reaction...
Nicholas Schiefer, Erik Winfree: Universal Computation and Optimal Construction in the Chemical Reaction Network-Controlled Tile Assembly Model. DNA 2015: 34-54
View ArticleTime Complexity of Computation and Construction in the Chemical Reaction...
Nicholas Schiefer, Erik Winfree: Time Complexity of Computation and Construction in the Chemical Reaction Network-Controlled Tile Assembly Model. DNA 2016: 165-182
View ArticleA Fill Estimation Algorithm for Sparse Matrices and Tensors in Blocked Formats.
Willow Ahrens, Helen Xu, Nicholas Schiefer: A Fill Estimation Algorithm for Sparse Matrices and Tensors in Blocked Formats. IPDPS 2018: 546-556
View ArticleFoundationDB Record Layer: A Multi-Tenant Structured Datastore.
Christos Chrysafis, Ben Collins, Scott Dugas, Jay Dunkelberger, Moussa Ehsan, Scott Gray, Alec Grieser, Ori Herrnstadt, Kfir Lev-Ari, Tao Lin, Mike McMahon, Nicholas Schiefer, Alexander Shraer:...
View ArticleFoundationDB Record Layer: A Multi-Tenant Structured Datastore.
Christos Chrysafis, Ben Collins, Scott Dugas, Jay Dunkelberger, Moussa Ehsan, Scott Gray, Alec Grieser, Ori Herrnstadt, Kfir Lev-Ari, Tao Lin, Mike McMahon, Nicholas Schiefer, Alexander Shraer:...
View ArticleDiscovering Language Model Behaviors with Model-Written Evaluations.
Ethan Perez, Sam Ringer, Kamile Lukosiute, Karina Nguyen, Edwin Chen, Scott Heiner, Craig Pettit, Catherine Olsson, Sandipan Kundu, Saurav Kadavath, Andy Jones, Anna Chen, Ben Mann, Brian Israel, Bryan...
View ArticleConstitutional AI: Harmlessness from AI Feedback.
Yuntao Bai, Saurav Kadavath, Sandipan Kundu, Amanda Askell, Jackson Kernion, Andy Jones, Anna Chen, Anna Goldie, Azalia Mirhoseini, Cameron McKinnon, Carol Chen, Catherine Olsson, Christopher Olah,...
View ArticleEngineering Monosemanticity in Toy Models.
Adam S. Jermyn, Nicholas Schiefer, Evan Hubinger: Engineering Monosemanticity in Toy Models. CoRR abs/2211.09169 (2022)
View ArticleMeasuring Progress on Scalable Oversight for Large Language Models.
Samuel R. Bowman, Jeeyoon Hyun, Ethan Perez, Edwin Chen, Craig Pettit, Scott Heiner, Kamile Lukosiute, Amanda Askell, Andy Jones, Anna Chen, Anna Goldie, Azalia Mirhoseini, Cameron McKinnon,...
View ArticleExponentially Improving the Complexity of Simulating the Weisfeiler-Lehman...
Anders Aamand, Justin Y. Chen, Piotr Indyk, Shyam Narayanan, Ronitt Rubinfeld, Nicholas Schiefer, Sandeep Silwal, Tal Wagner: Exponentially Improving the Complexity of Simulating the Weisfeiler-Lehman...
View ArticleToy Models of Superposition.
Nelson Elhage, Tristan Hume, Catherine Olsson, Nicholas Schiefer, Tom Henighan, Shauna Kravec, Zac Hatfield-Dodds, Robert Lasenby, Dawn Drain, Carol Chen, Roger Grosse, Sam McCandlish, Jared Kaplan,...
View ArticleRed Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and...
Deep Ganguli, Liane Lovitt, Jackson Kernion, Amanda Askell, Yuntao Bai, Saurav Kadavath, Ben Mann, Ethan Perez, Nicholas Schiefer, Kamal Ndousse, Andy Jones, Sam Bowman, Anna Chen, Tom Conerly, Nova...
View ArticleLanguage Models (Mostly) Know What They Know.
Saurav Kadavath, Tom Conerly, Amanda Askell, Tom Henighan, Dawn Drain, Ethan Perez, Nicholas Schiefer, Zac Hatfield-Dodds, Nova DasSarma, Eli Tran-Johnson, Scott Johnston, Sheer El Showk, Andy Jones,...
View ArticleExponentially Improving the Complexity of Simulating the Weisfeiler-Lehman...
Anders Aamand, Justin Y. Chen, Piotr Indyk, Shyam Narayanan, Ronitt Rubinfeld, Nicholas Schiefer, Sandeep Silwal, Tal Wagner: Exponentially Improving the Complexity of Simulating the Weisfeiler-Lehman...
View ArticleMerge what you can, fork what you can't: managing data integrity in...
Nicholas Schiefer, Geoffrey Litt, Daniel Jackson: Merge what you can, fork what you can't: managing data integrity in local-first software. PaPoC@EuroSys 2022: 24-32
View ArticleSpecific versus General Principles for Constitutional AI.
Sandipan Kundu, Yuntao Bai, Saurav Kadavath, Amanda Askell, Andrew Callahan, Anna Chen, Anna Goldie, Avital Balwit, Azalia Mirhoseini, Brayden McLean, Catherine Olsson, Cassie Evraets, Eli...
View ArticleTowards Understanding Sycophancy in Language Models.
Mrinank Sharma, Meg Tong, Tomasz Korbak, David Duvenaud, Amanda Askell, Samuel R. Bowman, Newton Cheng, Esin Durmus, Zac Hatfield-Dodds, Scott R. Johnston, Shauna Kravec, Timothy Maxwell, Sam...
View ArticleMeasuring Faithfulness in Chain-of-Thought Reasoning.
Tamera Lanham, Anna Chen, Ansh Radhakrishnan, Benoit Steiner, Carson Denison, Danny Hernandez, Dustin Li, Esin Durmus, Evan Hubinger, Jackson Kernion, Kamile Lukosiute, Karina Nguyen, Newton Cheng,...
View ArticleQuestion Decomposition Improves the Faithfulness of Model-Generated Reasoning.
Ansh Radhakrishnan, Karina Nguyen, Anna Chen, Carol Chen, Carson Denison, Danny Hernandez, Esin Durmus, Evan Hubinger, Jackson Kernion, Kamile Lukosiute, Newton Cheng, Nicholas Joseph, Nicholas...
View ArticleTowards Measuring the Representation of Subjective Global Opinions in...
Esin Durmus, Karina Nyugen, Thomas I. Liao, Nicholas Schiefer, Amanda Askell, Anton Bakhtin, Carol Chen, Zac Hatfield-Dodds, Danny Hernandez, Nicholas Joseph, Liane Lovitt, Sam McCandlish, Orowa...
View ArticleLearned Interpolation for Better Streaming Quantile Approximation with...
Nicholas Schiefer, Justin Y. Chen, Piotr Indyk, Shyam Narayanan, Sandeep Silwal, Tal Wagner: Learned Interpolation for Better Streaming Quantile Approximation with Worst-Case Guarantees. CoRR...
View ArticleThe Capacity for Moral Self-Correction in Large Language Models.
Deep Ganguli, Amanda Askell, Nicholas Schiefer, Thomas I. Liao, Kamile Lukosiute, Anna Chen, Anna Goldie, Azalia Mirhoseini, Catherine Olsson, Danny Hernandez, Dawn Drain, Dustin Li, Eli Tran-Johnson,...
View ArticleRiffle: Reactive Relational State for Local-First Applications.
Geoffrey Litt, Nicholas Schiefer, Johannes Schickling, Daniel Jackson: Riffle: Reactive Relational State for Local-First Applications. UIST 2023: 76:1-76:16
View ArticleDiscovering Language Model Behaviors with Model-Written Evaluations.
Ethan Perez, Sam Ringer, Kamile Lukosiute, Karina Nguyen, Edwin Chen, Scott Heiner, Craig Pettit, Catherine Olsson, Sandipan Kundu, Saurav Kadavath, Andy Jones, Anna Chen, Benjamin Mann, Brian Israel,...
View ArticleLearned Interpolation for Better Streaming Quantile Approximation with...
Nicholas Schiefer, Justin Y. Chen, Piotr Indyk, Shyam Narayanan, Sandeep Silwal, Tal Wagner: Learned Interpolation for Better Streaming Quantile Approximation with Worst-Case Guarantees. ACDA 2023: 87-97
View ArticleSleeper Agents: Training Deceptive LLMs that Persist Through Safety Training.
Evan Hubinger, Carson Denison, Jesse Mu, Mike Lambert, Meg Tong, Monte MacDiarmid, Tamera Lanham, Daniel M. Ziegler, Tim Maxwell, Newton Cheng, Adam S. Jermyn, Amanda Askell, Ansh Radhakrishnan, Cem...
View Article
More Pages to Explore .....