본문으로 건너뛰기
SuanLab

Blog

241개 중 1-12번째 포스트

[논문 리뷰] LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling
2026-05-149Paper Review

[논문 리뷰] LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

Test-time scaling (TTS) has become an effective approach for improving large language model performance by allocating additional computation during inference. However, existing TTS strategies are larg...

Paper Review
cs.CL
cs.CL
[논문 리뷰] Complex-Valued Phase-Coherent Transformer
2026-05-138Paper Review

[논문 리뷰] Complex-Valued Phase-Coherent Transformer

Complex-valued Transformers have largely inherited softmax attention from real-valued architectures. However, row-normalised token competition is not necessarily aligned with phase-preserving computat...

Paper Review
cs.LG
cs.LG
[논문 리뷰] Fast Byte Latent Transformer
2026-05-137Paper Review

[논문 리뷰] Fast Byte Latent Transformer

Recent byte-level language models (LMs) match the performance of token-level models without relying on subword vocabularies, yet their utility is limited by slow, byte-by-byte autoregressive generatio...

Paper Review
cs.CL
cs.AI
+1
[논문 리뷰] Temporal Straightening for Latent Planning
2026-05-118Paper Review

[논문 리뷰] Temporal Straightening for Latent Planning

Learning good representations is essential for latent planning with world models. While pretrained visual encoders produce strong semantic visual features, they are not tailored to planning and contai...

Paper Review
cs.LG
cs.LG
자연어 처리를 위한 새로운 혁명: 대규모 언어 모델의 등장
2026-05-096General

자연어 처리를 위한 새로운 혁명: 대규모 언어 모델의 등장

최근 몇 년간 인공지능(AI) 분야에서 가장 주목할 만한 발전은 단연 대규모 언어 모델(Large Language Model, LLM)의 부상입니다. OpenAI의 GPT-4, Google의 Gemini, Meta의 Llama 3와 같은 모델들은 자연어 처리(Natural Language Processing, NLP) 분야에 혁신을 가져왔습니다. 이러한 모...

[논문 리뷰] A Theory of Generalization in Deep Learning
2026-05-0914Paper Review

[논문 리뷰] A Theory of Generalization in Deep Learning

We present a non-asymptotic theory of generalization in deep learning where the empirical neural tangent kernel partitions the output space. In directions corresponding to signal, error dissipates rap...

Paper Review
cs.LG
stat.ML
+1
[논문 리뷰] Temporal Straightening for Latent Planning
2026-05-098Paper Review

[논문 리뷰] Temporal Straightening for Latent Planning

Learning good representations is essential for latent planning with world models. While pretrained visual encoders produce strong semantic visual features, they are not tailored to planning and contai...

Paper Review
cs.LG
cs.LG
[논문 리뷰] From Context to Skills: Can Language Models Learn from Context Skillfully?
2026-05-077Paper Review

[논문 리뷰] From Context to Skills: Can Language Models Learn from Context Skillfully?

Many real-world tasks require language models (LMs) to reason over complex contexts that exceed their parametric knowledge. This calls for context learning, where LMs directly learn relevant knowledge...

Paper Review
cs.AI
cs.AI
[논문 리뷰] Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models
2026-05-069Paper Review

[논문 리뷰] Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models

We propose a theoretical model called "information gravity" to describe the text generation process in large language models (LLMs). The model uses physical apparatus from field theory and spacetime g...

Paper Review
cs.CL
cs.CL
자연어 처리의 미래를 여는 열쇠: 휴먼-인-더-루프(Human-in-the-Loop)
2026-05-056General

자연어 처리의 미래를 여는 열쇠: 휴먼-인-더-루프(Human-in-the-Loop)

자연언어처리(Natural Language Processing, NLP)는 인공지능(AI) 분야에서 가장 빠르게 발전하는 영역 중 하나입니다. 특히 거대 언어 모델(Large Language Models, LLM)의 등장은 인간의 언어를 이해하고 생성하는 AI의 능력을 전례 없는 수준으로 끌어올렸습니다. 하지만 이러한 눈부신 발전에도 불구하고, AI는 여전...

[논문 리뷰] The Last Human-Written Paper: Agent-Native Research Artifacts
2026-05-049Paper Review

[논문 리뷰] The Last Human-Written Paper: Agent-Native Research Artifacts

Scientific publication compresses a branching, iterative research process into a linear narrative, discarding the majority of what was discovered along the way. This compilation imposes two structural...

Paper Review
cs.LG
cs.LG
[논문 리뷰] The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook
2026-05-0418Paper Review

[논문 리뷰] The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook

Latent space is rapidly emerging as a native substrate for language-based models. While modern systems are still commonly understood through explicit token-level generation, an increasing body of work...

Paper Review
cs.AI
cs.AI
...