본문으로 건너뛰기
SuanLab

Blog

257개 중 1-12번째 포스트

[논문 리뷰] Generative Recursive Reasoning
2026-05-228Paper Review

[논문 리뷰] Generative Recursive Reasoning

How should future neural reasoning systems implement extended computation? Recursive Reasoning Models (RRMs) offer a promising alternative to autoregressive sequence extension by performing iterative ...

Paper Review
cs.AI
cs.AI
[논문 리뷰] Language Game: Talking to Non-Human Systems
2026-05-229Paper Review

[논문 리뷰] Language Game: Talking to Non-Human Systems

Language carries thought and coordination among humans but rarely reaches further along the spectrum of diverse intelligence. Yet non-neural systems -- from gene regulatory networks and microbial cons...

Paper Review
cs.LG
cs.LG
[논문 리뷰] TRINITY: An Evolved LLM Coordinator
2026-05-229Paper Review

[논문 리뷰] TRINITY: An Evolved LLM Coordinator

Combining diverse foundation models is promising, but weight-merging is limited by mismatched architectures and closed APIs. Trinity addresses this with a lightweight coordinator that orchestrates col...

Paper Review
cs.LG
cs.LG
[논문 리뷰] Code as Agent Harness
2026-05-2118Paper Review

[논문 리뷰] Code as Agent Harness

Recent large language models (LLMs) have demonstrated strong capabilities in understanding and generating code, from competitive programming to repository-level software engineering. In emerging agent...

Paper Review
cs.CL
cs.AI
+1
[논문 리뷰] MIRAGE: The Illusion of Visual Understanding
2026-05-187Paper Review

[논문 리뷰] MIRAGE: The Illusion of Visual Understanding

Multimodal AI systems have achieved remarkable performance across a broad range of real-world tasks, yet the mechanisms underlying visual-language reasoning remain surprisingly poorly understood. We r...

Paper Review
cs.AI
cs.AI
[논문 리뷰] ETS: Energy-Guided Test-Time Scaling for Training-Free RL Alignment
2026-05-1816Paper Review

[논문 리뷰] ETS: Energy-Guided Test-Time Scaling for Training-Free RL Alignment

Reinforcement Learning (RL) post-training alignment for language models is effective, but also costly and unstable in practice, owing to its complicated training process. To address this, we propose a...

Paper Review
cs.LG
cs.LG
[논문 리뷰] Adaptation of Agentic AI: A Survey of Post-Training, Memory, and Skills
2026-05-1819Paper Review

[논문 리뷰] Adaptation of Agentic AI: A Survey of Post-Training, Memory, and Skills

Large language model (LLM) agents are moving beyond prompting alone. ChatGPT marked the rise of general-purpose LLM assistants, DeepSeek showed that on-policy reinforcement learning with verifiable re...

Paper Review
cs.AI
cs.CL
+1
[논문 리뷰] GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models
2026-05-188Paper Review

[논문 리뷰] GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

Recent advancements in Large Language Models (LLMs) have sparked interest in their formal reasoning capabilities, particularly in mathematics. The GSM8K benchmark is widely used to assess the mathemat...

Paper Review
cs.LG
cs.AI
+1
[논문 리뷰] Compressed Convolutional Attention: Efficient Attention in a Compressed Latent Space
2026-05-168Paper Review

[논문 리뷰] Compressed Convolutional Attention: Efficient Attention in a Compressed Latent Space

Multi-headed Attention's (MHA) quadratic compute and linearly growing KV-cache make long-context transformers expensive to train and serve. Prior works such as Grouped Query Attention (GQA) and Multi-...

Paper Review
cs.CL
cs.AI
+1
[논문 리뷰] Memory-Efficient Looped Transformer: Decoupling Compute from Memory in Looped Language Models
2026-05-157Paper Review

[논문 리뷰] Memory-Efficient Looped Transformer: Decoupling Compute from Memory in Looped Language Models

Recurrent LLM architectures have emerged as a promising approach for improving reasoning, as they enable multi-step computation in the embedding space without generating intermediate tokens. Models su...

Paper Review
cs.CL
cs.AI
+1
[논문 리뷰] AI co-mathematician: Accelerating mathematicians with agentic AI
2026-05-157Paper Review

[논문 리뷰] AI co-mathematician: Accelerating mathematicians with agentic AI

We introduce the AI co-mathematician, a workbench for mathematicians to interactively leverage AI agents to pursue open-ended research. The AI co-mathematician is optimized to provide holistic support...

Paper Review
cs.AI
cs.AI
[논문 리뷰] Functional Post-Clustering Selective Inference with Applications to EHR Data Analysis
2026-05-1511Paper Review

[논문 리뷰] Functional Post-Clustering Selective Inference with Applications to EHR Data Analysis

In electronic health records (EHR) analysis, clustering patients according to patterns in their data is crucial for uncovering new subtypes of diseases. Existing medical literature often relies on cla...

Paper Review
stat.ME
stat.AP
+1
...