Blog

270개 중 1-12번째 포스트

2026-06-03•8분•Paper Review

[논문 리뷰] LT2: Linear-Time Looped Transformers

Looped Transformers (LT) have emerged as a powerful architecture by iterating their layers multiple times before decoding the final token. However, pairing them with full attention retains quadratic c...

Paper Review

cs.LG

2026-06-03•6분•Paper Review

[논문 리뷰] Hallucinations Undermine Trust; Metacognition is a Way Forward

Despite significant strides in factual reliability, errors -- often termed hallucinations -- remain a major concern for generative AI, especially as LLMs are increasingly expected to be helpful in mor...

Paper Review

cs.CL

2026-06-03•9분•Paper Review

[논문 리뷰] AI Must Embrace Specialization via Superhuman Adaptable Intelligence

Everyone from AI executives and researchers to doomsayers, politicians, and activists is talking about Artificial General Intelligence (AGI). Yet, they often don't seem to agree on its exact definitio...

Paper Review

cs.AI

2026-06-02•8분•Paper Review

[논문 리뷰] AutoScientists: Self-Organizing Agent Teams for Long-Running Scientific Experimentation

Scientific research proceeds through iterative cycles of hypothesis generation, experiment design, execution, and revision. AI agents can automate parts of this process, but existing approaches typica...

Paper Review

cs.AI

2026-06-02•15분•Paper Review

[논문 리뷰] AI for Auto-Research: Roadmap & User Guide

AI-assisted research is crossing a threshold: fully automated systems can now generate research papers for as little as $15, while long-horizon agents can execute experiments, draft manuscripts, and s...

Paper Review

cs.AI

2026-06-02•9분•Paper Review

[논문 리뷰] Neural Weight Norm = Kolmogorov Complexity

Why does weight decay work? We prove that, in any fixed-precision regime, the smallest weight norm of a looped neural network outputting a binary string equals the Kolmogorov complexity of that string...

Paper Review

cs.LG

cs.IT

2026-05-26•7분•Paper Review

[논문 리뷰] Language Models Need Sleep

Transformer-based large language models are increasingly used for long-horizon tasks; however, their attention mechanism scales poorly with context length. To handle this, we study a sleep-like consol...

Paper Review

cs.CL

cs.AI

2026-05-26•8분•Paper Review

[논문 리뷰] SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Agent skills today are hand-crafted, generated one-shot, or evolved through loosely controlled self-revision, none of which behaves like a deep-learning optimizer for the skill, and none of which reli...

Paper Review

cs.AI

cs.CL

2026-05-24•7분•Paper Review

[논문 리뷰] Tokenisation via Convex Relaxations

Tokenisation is an integral part of the current NLP pipeline. Current tokenisation algorithms such as BPE and Unigram are greedy algorithms -- they make locally optimal decisions without considering t...

Paper Review

cs.CL

cs.LG

2026-05-24•8분•Paper Review

[논문 리뷰] Advancing Mathematics Research with AI-Driven Formal Proof Search

Large language models (LLMs) increasingly excel at mathematical reasoning, but their unreliability limits their utility in mathematics research. A mitigation is using LLMs to generate formal proofs in...

Paper Review

cs.AI

2026-05-24•16분•Paper Review

[논문 리뷰] Equilibrium Reasoners: Learning Attractors Enables Scalable Reasoning

Scaling test-time compute by iteratively updating a latent state has emerged as a powerful paradigm for reasoning. Yet the internal mechanisms that enable these iterative models to generalize beyond m...

Paper Review

cs.LG

2026-05-24•9분•Paper Review

[논문 리뷰] Hallucinations Undermine Trust; Metacognition is a Way Forward

Paper Review

cs.CL

...

Blog

[논문 리뷰] LT2: Linear-Time Looped Transformers

[논문 리뷰] Hallucinations Undermine Trust; Metacognition is a Way Forward

[논문 리뷰] AI Must Embrace Specialization via Superhuman Adaptable Intelligence

[논문 리뷰] AutoScientists: Self-Organizing Agent Teams for Long-Running Scientific Experimentation

[논문 리뷰] AI for Auto-Research: Roadmap &amp; User Guide

[논문 리뷰] Neural Weight Norm = Kolmogorov Complexity

[논문 리뷰] Language Models Need Sleep

[논문 리뷰] SkillOpt: Executive Strategy for Self-Evolving Agent Skills

[논문 리뷰] Tokenisation via Convex Relaxations

[논문 리뷰] Advancing Mathematics Research with AI-Driven Formal Proof Search

[논문 리뷰] Equilibrium Reasoners: Learning Attractors Enables Scalable Reasoning

[논문 리뷰] Hallucinations Undermine Trust; Metacognition is a Way Forward

[논문 리뷰] AI for Auto-Research: Roadmap & User Guide