Professor's pick
작성자
admidea
작성일
2025-10-13
조회
51
1. The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity
2. PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models
3. Continuous Thought Machines
4. Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers
5. Learning to Reason without External Rewards
6. WhyKnowledge Distillation Works in Generative Models: A Minimal Working Explanation
7. Absolute Zero: Reinforced Self-play Reasoning with Zero Data
8. Learning without training: The implicit dynamics of in-context learning
9. AI ResearchAgentsforMachineLearning: Search, Exploration, and Generalization in MLE-bench
10. Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory
11. Memento: Fine-tuning LLM Agents without Fine-tuning LLMs
12. Transformers are SSMs: generalized models and efficient algorithms through structured state space duality. Proceedings of the 41st International Conference on Machine Learning.
13. Large Language Model Hacking: Quantifying the Hidden Risks of Using LLMs for Text Annotation
2. PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models
3. Continuous Thought Machines
4. Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers
5. Learning to Reason without External Rewards
6. WhyKnowledge Distillation Works in Generative Models: A Minimal Working Explanation
7. Absolute Zero: Reinforced Self-play Reasoning with Zero Data
8. Learning without training: The implicit dynamics of in-context learning
9. AI ResearchAgentsforMachineLearning: Search, Exploration, and Generalization in MLE-bench
10. Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory
11. Memento: Fine-tuning LLM Agents without Fine-tuning LLMs
12. Transformers are SSMs: generalized models and efficient algorithms through structured state space duality. Proceedings of the 41st International Conference on Machine Learning.
13. Large Language Model Hacking: Quantifying the Hidden Risks of Using LLMs for Text Annotation
전체 0
- 다음 다음글이 없습니다.
- 이전 이전글이 없습니다.