Name		Name	Last commit message	Last commit date
Latest commit History 129 Commits
Archive.md		Archive.md
README.md		README.md

Repository files navigation

LLM Papers

Compression

Compression for AGI 2023.02
Language Modeling Is Compression 2023.09

Language

Language is primarily a tool for communication rather than thought 2024.06

Scaling

Training Compute-Optimal Large Language Models 2022.03
The Platonic Representation Hypothesis 2024.05
Learning to Reason with LLMs 2024.09.12
Parables on the Power of Planning in AI: From Poker to Diplomacy 2024.09.18
Don't teach. Incentivize. 2024.09.20
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning 2025.01
Kimi k1.5: Scaling Reinforcement Learning with LLMs 2025.01

Representation

Language Models Represent Space and Time 2023.10

Architecture

Alignment

Scalable Oversight

Self-critiquing models for assisting human evaluators 2022.06
Weak-to-strong generalization 2023.12
Prover-Verifier Games improve legibility of LLM outputs 2024.07

Theorem Proving

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search 2024.08

ICL

Larger language models do in-context learning differently 2023.03
Many-Shot In-Context Learning 2024.04

Math & Reasoning

Tool

WebGPT: Browser-assisted question-answering with human feedback 2021.12

Report

Llama 2: Open Foundation and Fine-Tuned Chat Models 2023.07
Gemini 1.0 2023.12
Gemini 1.5 2024.02
The Llama 3 Herd of Models 2024.07
DeepSeek-V3 Technical Report 2024.12

Talk & Blog

Distinguishing three alignment taxes 2022.12
State of GPT 2023.05
An Observation on Generalization 2023.08
An Initial Exploration of Theoretical Support for Language Model Data Engineering. Part 1: Pretraining 2023.09
Some intuitions about large language models 2023.11
MiniCPM：揭示端侧大语言模型的无限潜力 2024.04
Llama 3 Opens the Second Chapter of the Game of Scale 2024.04
Successful language model evals 2024.05
OpenAI Model Spec 2024.05
Claude’s Character 2024.06
AI achieves silver-medal standard solving International Mathematical Olympiad problems 2024.07
Three hypotheses on LLM reasoning 2024.12
Scaling Paradigms for Large Language Models 2025.01

Physics of Language Models

Evaluation

Quality

Scaling Laws and Interpretability of Learning from Repeated Data 2022.05

Efficient

LoRA: Low-Rank Adaptation of Large Language Models 2021.06

Merging

Evolutionary Optimization of Model Merging Recipes 2024.03

Search

A Survey of Monte Carlo Tree Search Methods 2012.03

Multimodality Papers

About

No description, website, or topics provided.

Report repository

Releases

No releases published

Packages

No packages published