AI-Paper-2024

Most Valuable AI Research Papers in 2024

Mixture of Experts Approach (MoE)

You can find the original paper^[3] here.

DoRA: Weight-decomposed LoRA

You can find the original paper^[4] here.

Simple and Scalable Strategies to Continually Pre-train Large Language Models

You can find the original paper^[5] here.

Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study

You can find the original paper^[6] here.

LoRA Learns Less and Forgets Less

You can find the original paper^[7] here.

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

You can find the original paper^[8] here.

The Llama 3 Herd of Models

You can find the original paper^[9] here.

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

You can find the original paper^[10] here.

NVLM: Open Frontier-Class Multimodal LLMs

You can find the original paper^[11] here.

O1 Replication Journey: A Strategic Progress Report – Part 1

You can find the original paper^[12] here.

Scaling Laws for Precision

You can find the original paper^[13] here.

DeepSeek-V3 Technical Report

You can find the original paper^[14] here.

Phi-4 Technical Report

You can find the original paper^[15] here.

References

Artificial Intelligence

#AI research #Artificial Intelligence #Deep Learning #LLMs

AI-Paper-2024

https://xiyuanyang-code.github.io/posts/AI-Paper-2024/

Author

Xiyuan Yang

Posted on

March 25, 2025

Updated on

May 10, 2025

Licensed under

Algorithm-BFS-DFS Previous

DataStructure-Set Next

AI-Paper-2024

Most Valuable AI Research Papers in 2024

Mixture of Experts Approach (MoE)

DoRA: Weight-decomposed LoRA

Simple and Scalable Strategies to Continually Pre-train Large Language Models

Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study

LoRA Learns Less and Forgets Less

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

The Llama 3 Herd of Models

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

NVLM: Open Frontier-Class Multimodal LLMs

O1 Replication Journey: A Strategic Progress Report – Part 1

Scaling Laws for Precision

DeepSeek-V3 Technical Report

Phi-4 Technical Report

References

papers related