본문으로 건너뛰기
SuanLab

Blog

274개 중 1-12번째 포스트

[논문 리뷰] LeanMarathon: Toward Reliable AI Co-Mathematicians through Long-Horizon Lean Autoformalization
2026-06-079Paper Review

[논문 리뷰] LeanMarathon: Toward Reliable AI Co-Mathematicians through Long-Horizon Lean Autoformalization

Long-horizon autoformalization of research mathematics fails not only at hard lemmas, but at scale: statements drift, dependencies tangle, context decays, and local repairs corrupt distant work. We pr...

Paper Review
cs.AI
cs.CL
+1
[논문 리뷰] Self-Revising Discovery Systems for Science: A Categorical Framework for Agentic Artificial Intelligence
2026-06-0718Paper Review

[논문 리뷰] Self-Revising Discovery Systems for Science: A Categorical Framework for Agentic Artificial Intelligence

Scientific discovery is not only answer generation but revision of the representational regime in which evidence, artifacts, operations, and verifiers are typed. We develop a category-theoretic accoun...

Paper Review
cs.AI
cond-mat.mtrl-sci
+1
[논문 리뷰] Memory Caching: RNNs with Growing Memory
2026-06-078Paper Review

[논문 리뷰] Memory Caching: RNNs with Growing Memory

Transformers have been established as the de-facto backbones for most recent advances in sequence modeling, mainly due to their growing memory capacity that scales with the context length. While plaus...

Paper Review
cs.LG
cs.AI
+1
[논문 리뷰] From Tokens to Thoughts: How LLMs and Humans Trade Compression for Meaning
2026-06-078Paper Review

[논문 리뷰] From Tokens to Thoughts: How LLMs and Humans Trade Compression for Meaning

Humans organize knowledge into compact conceptual categories that balance compression with semantic richness. Large Language Models (LLMs) exhibit impressive linguistic abilities, but whether they nav...

Paper Review
cs.CL
cs.AI
+1
[논문 리뷰] LT2: Linear-Time Looped Transformers
2026-06-038Paper Review

[논문 리뷰] LT2: Linear-Time Looped Transformers

Looped Transformers (LT) have emerged as a powerful architecture by iterating their layers multiple times before decoding the final token. However, pairing them with full attention retains quadratic c...

Paper Review
cs.LG
cs.LG
[논문 리뷰] Hallucinations Undermine Trust; Metacognition is a Way Forward
2026-06-036Paper Review

[논문 리뷰] Hallucinations Undermine Trust; Metacognition is a Way Forward

Despite significant strides in factual reliability, errors -- often termed hallucinations -- remain a major concern for generative AI, especially as LLMs are increasingly expected to be helpful in mor...

Paper Review
cs.CL
cs.CL
[논문 리뷰] AI Must Embrace Specialization via Superhuman Adaptable Intelligence
2026-06-039Paper Review

[논문 리뷰] AI Must Embrace Specialization via Superhuman Adaptable Intelligence

Everyone from AI executives and researchers to doomsayers, politicians, and activists is talking about Artificial General Intelligence (AGI). Yet, they often don't seem to agree on its exact definitio...

Paper Review
cs.AI
cs.AI
[논문 리뷰] AutoScientists: Self-Organizing Agent Teams for Long-Running Scientific Experimentation
2026-06-028Paper Review

[논문 리뷰] AutoScientists: Self-Organizing Agent Teams for Long-Running Scientific Experimentation

Scientific research proceeds through iterative cycles of hypothesis generation, experiment design, execution, and revision. AI agents can automate parts of this process, but existing approaches typica...

Paper Review
cs.AI
cs.AI
[논문 리뷰] AI for Auto-Research: Roadmap & User Guide
2026-06-0215Paper Review

[논문 리뷰] AI for Auto-Research: Roadmap & User Guide

AI-assisted research is crossing a threshold: fully automated systems can now generate research papers for as little as $15, while long-horizon agents can execute experiments, draft manuscripts, and s...

Paper Review
cs.AI
cs.AI
[논문 리뷰] Neural Weight Norm = Kolmogorov Complexity
2026-06-029Paper Review

[논문 리뷰] Neural Weight Norm = Kolmogorov Complexity

Why does weight decay work? We prove that, in any fixed-precision regime, the smallest weight norm of a looped neural network outputting a binary string equals the Kolmogorov complexity of that string...

Paper Review
cs.LG
cs.IT
+1
[논문 리뷰] Language Models Need Sleep
2026-05-267Paper Review

[논문 리뷰] Language Models Need Sleep

Transformer-based large language models are increasingly used for long-horizon tasks; however, their attention mechanism scales poorly with context length. To handle this, we study a sleep-like consol...

Paper Review
cs.CL
cs.AI
+1
[논문 리뷰] SkillOpt: Executive Strategy for Self-Evolving Agent Skills
2026-05-268Paper Review

[논문 리뷰] SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Agent skills today are hand-crafted, generated one-shot, or evolved through loosely controlled self-revision, none of which behaves like a deep-learning optimizer for the skill, and none of which reli...

Paper Review
cs.AI
cs.CL
+1
...