Links for 2025-03-05
AI
1. Why do some LMs self-improve their reasoning while others hit a wall. Four key cognitive behaviors enable successful learning: Verification (checking work), Backtracking (trying new approaches), Subgoal Setting, and Backward Chaining (working backwards from a goal). https://arxiv.org/abs/2503.01307
2. A Three-Layer Model of LLM Psychology https://www.lesswrong.com/posts/zuXo9imNKYspu9HGv/a-three-layer-model-of-llm-psychology
3. Chain of Draft: Thinking Faster by Writing Less—80% fewer tokens per response yet maintains accuracy on math, commonsense, and other benchmarks. On GSM8k math problems, CoD achieved 91% accuracy with an 80% token reduction compared to CoT. https://arxiv.org/abs/2502.18600
4. Reasoning models will enable superhuman capabilities in “pure reasoning tasks” such as mathematics and abstract problem-solving https://epoch.ai/gradient-updates/the-promise-of-reasoning-models
5. SoS1: O1 and R1-Like Reasoning LLMs are Sum-of-Square Solvers — “Our findings highlight the potential of LLMs to push the boundaries of mathematical reasoning and tackle NP-hard problems.” https://arxiv.org/abs/2502.20545
6. LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction https://arxiv.org/abs/2502.17925
7. The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models https://arxiv.org/abs/2503.02875
8. How Much Are LLMs Actually Boosting Real-World Programmer Productivity? https://www.lesswrong.com/posts/tqmQTezvXGFmfSe7f/how-much-are-llms-actually-boosting-real-world-programmer
9. New results on AI and lawyer productivity https://marginalrevolution.com/marginalrevolution/2025/03/new-results-on-ai-and-lawyer-productivity.html
10. German nuclear fusion startup Proxima Fusion works on a smart AI-assisted stellarator concept https://www.proximafusion.com/press-news/proxima-fusion-and-partners-publish-stellaris-fusion-power-plant-concept-to-bring-limitless-safe-clean-energy-to-the-grid
11. Alexa+: the next generation of Alexa—it uses Amazon's own Nova models as well as Claude, and will dynamically switch to the best model for each task. https://www.aboutamazon.com/news/devices/new-alexa-generative-artificial-intelligence
12. Opera's new Al-powered Operator browser can surf the web for you https://blogs.opera.com/news/2025/03/opera-browser-operator-ai-agentics/
AI politics
1. “The Government Knows A.G.I. is Coming” https://www.nytimes.com/2025/03/04/opinion/ezra-klein-podcast-ben-buchanan.html [no paywall: https://archive.is/cj6G1]
2. Scale AI announces multimillion-dollar defense deal, a major step in U.S. military automation https://www.cnbc.com/2025/03/05/scale-ai-announces-multimillion-dollar-defense-military-deal.html
3. Alibaba's CEO: They’re going all-in on AGI development as their primary focus. https://www.bloomberg.com/news/articles/2025-02-20/alibaba-ceo-wu-says-agi-is-now-company-s-primary-objective [no paywall: https://archive.is/0S4H9]
Brains
1. New minimally-invasive neural interface can be placed almost anywhere in the brain through a single spinal tap. https://www.nature.com/articles/s41551-024-01281-9
2. Can we compare subjective experiences (qualia) between individuals? https://www.cell.com/iscience/fulltext/S2589-0042(25)00289-5
Biotech and Security
1. Roche next generation sequencing https://www.youtube.com/watch?v=G8ECt04qPos
2. Delivering therapeutics to the brain through intranasal application of engineered commensal bacteria https://www.cell.com/cell/fulltext/S0092-8674(25)00046-7
3. Methods for strong human germline engineering https://www.lesswrong.com/posts/2w6hjptanQ3cDyDw7/methods-for-strong-human-germline-engineering
Technology
1. Amazon announces Ocelot quantum chip https://www.amazon.science/blog/amazon-announces-ocelot-quantum-chip
2. As of today, you can fit an ENTIRE COMPUTER into a single piece of thread. Analog sensing, LEDs, bluetooth comms, processing, digital memory - it's all there https://www.nature.com/articles/s41586-024-08568-6
AI
1. Why do some LMs self-improve their reasoning while others hit a wall. Four key cognitive behaviors enable successful learning: Verification (checking work), Backtracking (trying new approaches), Subgoal Setting, and Backward Chaining (working backwards from a goal). https://arxiv.org/abs/2503.01307
2. A Three-Layer Model of LLM Psychology https://www.lesswrong.com/posts/zuXo9imNKYspu9HGv/a-three-layer-model-of-llm-psychology
3. Chain of Draft: Thinking Faster by Writing Less—80% fewer tokens per response yet maintains accuracy on math, commonsense, and other benchmarks. On GSM8k math problems, CoD achieved 91% accuracy with an 80% token reduction compared to CoT. https://arxiv.org/abs/2502.18600
4. Reasoning models will enable superhuman capabilities in “pure reasoning tasks” such as mathematics and abstract problem-solving https://epoch.ai/gradient-updates/the-promise-of-reasoning-models
5. SoS1: O1 and R1-Like Reasoning LLMs are Sum-of-Square Solvers — “Our findings highlight the potential of LLMs to push the boundaries of mathematical reasoning and tackle NP-hard problems.” https://arxiv.org/abs/2502.20545
6. LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction https://arxiv.org/abs/2502.17925
7. The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models https://arxiv.org/abs/2503.02875
8. How Much Are LLMs Actually Boosting Real-World Programmer Productivity? https://www.lesswrong.com/posts/tqmQTezvXGFmfSe7f/how-much-are-llms-actually-boosting-real-world-programmer
9. New results on AI and lawyer productivity https://marginalrevolution.com/marginalrevolution/2025/03/new-results-on-ai-and-lawyer-productivity.html
10. German nuclear fusion startup Proxima Fusion works on a smart AI-assisted stellarator concept https://www.proximafusion.com/press-news/proxima-fusion-and-partners-publish-stellaris-fusion-power-plant-concept-to-bring-limitless-safe-clean-energy-to-the-grid
11. Alexa+: the next generation of Alexa—it uses Amazon's own Nova models as well as Claude, and will dynamically switch to the best model for each task. https://www.aboutamazon.com/news/devices/new-alexa-generative-artificial-intelligence
12. Opera's new Al-powered Operator browser can surf the web for you https://blogs.opera.com/news/2025/03/opera-browser-operator-ai-agentics/
AI politics
1. “The Government Knows A.G.I. is Coming” https://www.nytimes.com/2025/03/04/opinion/ezra-klein-podcast-ben-buchanan.html [no paywall: https://archive.is/cj6G1]
2. Scale AI announces multimillion-dollar defense deal, a major step in U.S. military automation https://www.cnbc.com/2025/03/05/scale-ai-announces-multimillion-dollar-defense-military-deal.html
3. Alibaba's CEO: They’re going all-in on AGI development as their primary focus. https://www.bloomberg.com/news/articles/2025-02-20/alibaba-ceo-wu-says-agi-is-now-company-s-primary-objective [no paywall: https://archive.is/0S4H9]
Brains
1. New minimally-invasive neural interface can be placed almost anywhere in the brain through a single spinal tap. https://www.nature.com/articles/s41551-024-01281-9
2. Can we compare subjective experiences (qualia) between individuals? https://www.cell.com/iscience/fulltext/S2589-0042(25)00289-5
Biotech and Security
1. Roche next generation sequencing https://www.youtube.com/watch?v=G8ECt04qPos
2. Delivering therapeutics to the brain through intranasal application of engineered commensal bacteria https://www.cell.com/cell/fulltext/S0092-8674(25)00046-7
3. Methods for strong human germline engineering https://www.lesswrong.com/posts/2w6hjptanQ3cDyDw7/methods-for-strong-human-germline-engineering
Technology
1. Amazon announces Ocelot quantum chip https://www.amazon.science/blog/amazon-announces-ocelot-quantum-chip
2. As of today, you can fit an ENTIRE COMPUTER into a single piece of thread. Analog sensing, LEDs, bluetooth comms, processing, digital memory - it's all there https://www.nature.com/articles/s41586-024-08568-6