Blog
262개 중 1-12번째 포스트
![[논문 리뷰] Tokenisation via Convex Relaxations](/assets/images/blog/20260524-paper-2605-22821-tokenisation-via-convex-relaxa.jpg)
[논문 리뷰] Tokenisation via Convex Relaxations
Tokenisation is an integral part of the current NLP pipeline. Current tokenisation algorithms such as BPE and Unigram are greedy algorithms -- they make locally optimal decisions without considering t...
![[논문 리뷰] Advancing Mathematics Research with AI-Driven Formal Proof Search](/assets/images/blog/20260524-paper-2605-22763-advancing-mathematics-research.jpg)
[논문 리뷰] Advancing Mathematics Research with AI-Driven Formal Proof Search
Large language models (LLMs) increasingly excel at mathematical reasoning, but their unreliability limits their utility in mathematics research. A mitigation is using LLMs to generate formal proofs in...
![[논문 리뷰] Equilibrium Reasoners: Learning Attractors Enables Scalable Reasoning](/assets/images/blog/20260524-paper-2605-21488-equilibrium-reasoners-learning.jpg)
[논문 리뷰] Equilibrium Reasoners: Learning Attractors Enables Scalable Reasoning
Scaling test-time compute by iteratively updating a latent state has emerged as a powerful paradigm for reasoning. Yet the internal mechanisms that enable these iterative models to generalize beyond m...
![[논문 리뷰] Hallucinations Undermine Trust; Metacognition is a Way Forward](/assets/images/blog/20260524-paper-2605-01428-hallucinations-undermine-trust.jpg)
[논문 리뷰] Hallucinations Undermine Trust; Metacognition is a Way Forward
Despite significant strides in factual reliability, errors -- often termed hallucinations -- remain a major concern for generative AI, especially as LLMs are increasingly expected to be helpful in mor...
![[논문 리뷰] The Alien Space of Science: Sampling Coherent but Cognitively Unavailable Research Directions](/assets/images/blog/20260524-paper-2603-01092-the-alien-space-of-science-sam.jpg)
[논문 리뷰] The Alien Space of Science: Sampling Coherent but Cognitively Unavailable Research Directions
Scientific discovery is constrained not only by what is true, but by what is cognitively available to the researchers currently exploring a field. Many directions are coherent in light of the literatu...
![[논문 리뷰] Generative Recursive Reasoning](/assets/images/blog/20260522-paper-2605-19376-generative-recursive-reasoning.jpg)
[논문 리뷰] Generative Recursive Reasoning
How should future neural reasoning systems implement extended computation? Recursive Reasoning Models (RRMs) offer a promising alternative to autoregressive sequence extension by performing iterative ...
![[논문 리뷰] Language Game: Talking to Non-Human Systems](/assets/images/blog/20260522-paper-2605-16321-language-game-talking-to-non-h.jpg)
[논문 리뷰] Language Game: Talking to Non-Human Systems
Language carries thought and coordination among humans but rarely reaches further along the spectrum of diverse intelligence. Yet non-neural systems -- from gene regulatory networks and microbial cons...
![[논문 리뷰] TRINITY: An Evolved LLM Coordinator](/assets/images/blog/20260522-paper-2512-04695-trinity-an-evolved-llm-coordin.jpg)
[논문 리뷰] TRINITY: An Evolved LLM Coordinator
Combining diverse foundation models is promising, but weight-merging is limited by mismatched architectures and closed APIs. Trinity addresses this with a lightweight coordinator that orchestrates col...
![[논문 리뷰] Code as Agent Harness](/assets/images/blog/20260521-paper-2605-18747-code-as-agent-harness.jpg)
[논문 리뷰] Code as Agent Harness
Recent large language models (LLMs) have demonstrated strong capabilities in understanding and generating code, from competitive programming to repository-level software engineering. In emerging agent...
![[논문 리뷰] MIRAGE: The Illusion of Visual Understanding](/assets/images/blog/20260518-paper-2603-21687-mirage-the-illusion-of-visual-.jpg)
[논문 리뷰] MIRAGE: The Illusion of Visual Understanding
Multimodal AI systems have achieved remarkable performance across a broad range of real-world tasks, yet the mechanisms underlying visual-language reasoning remain surprisingly poorly understood. We r...
![[논문 리뷰] ETS: Energy-Guided Test-Time Scaling for Training-Free RL Alignment](/assets/images/blog/20260518-paper-2601-21484-ets-energy-guided-test-time-sc.jpg)
[논문 리뷰] ETS: Energy-Guided Test-Time Scaling for Training-Free RL Alignment
Reinforcement Learning (RL) post-training alignment for language models is effective, but also costly and unstable in practice, owing to its complicated training process. To address this, we propose a...
![[논문 리뷰] Adaptation of Agentic AI: A Survey of Post-Training, Memory, and Skills](/assets/images/blog/20260518-paper-2512-16301-adaptation-of-agentic-ai-a-sur.jpg)
[논문 리뷰] Adaptation of Agentic AI: A Survey of Post-Training, Memory, and Skills
Large language model (LLM) agents are moving beyond prompting alone. ChatGPT marked the rise of general-purpose LLM assistants, DeepSeek showed that on-policy reinforcement learning with verifiable re...
