본문으로 건너뛰기
SuanLab

Blog

210개 중 1-12번째 포스트

[논문 리뷰] Intuitive physics understanding emerges from self-supervised pretraining on natural videos
2026-04-279Paper Review

[논문 리뷰] Intuitive physics understanding emerges from self-supervised pretraining on natural videos

We investigate the emergence of intuitive physics understanding in general-purpose deep neural network models trained to predict masked regions in natural videos. Leveraging the violation-of-expectati...

Paper Review
cs.CV
cs.AI
+1
[논문 리뷰] Intuitive physics understanding emerges from self-supervised pretraining on natural videos
2026-04-058Paper Review

[논문 리뷰] Intuitive physics understanding emerges from self-supervised pretraining on natural videos

We investigate the emergence of intuitive physics understanding in general-purpose deep neural network models trained to predict masked regions in natural videos. Leveraging the violation-of-expectati...

Paper Review
cs.CV
cs.AI
+1
[논문 리뷰] Internalizing Agency from Reflective Experience
2026-03-1917Paper Review

[논문 리뷰] Internalizing Agency from Reflective Experience

Large language models are increasingly deployed as autonomous agents that must plan, act, and recover from mistakes through long-horizon interaction with environments that provide rich feedback. Howev...

Paper Review
cs.AI
cs.AI
[논문 리뷰] Language Models are Injective and Hence Invertible
2026-03-198Paper Review

[논문 리뷰] Language Models are Injective and Hence Invertible

Transformer components such as non-linear activations and normalization are inherently non-injective, suggesting that different inputs could map to the same output and prevent exact recovery of the in...

Paper Review
cs.LG
cs.AI
+1
[논문 리뷰] Attention Residuals
2026-03-1810Paper Review

[논문 리뷰] Attention Residuals

Residual connections with PreNorm are standard in modern LLMs, yet they accumulate all layer outputs with fixed unit weights. This uniform aggregation causes uncontrolled hidden-state growth with dept...

Paper Review
cs.CL
cs.CL
[논문 리뷰] Sparking Scientific Creativity via LLM-Driven Interdisciplinary Inspiration
2026-03-1710Paper Review

[논문 리뷰] Sparking Scientific Creativity via LLM-Driven Interdisciplinary Inspiration

Despite interdisciplinary research leading to larger and longer-term impact, most work remains confined to single-domain academic silos. Recent AI-based approaches to scientific discovery show promise...

Paper Review
cs.CL
cs.AI
+1
[논문 리뷰] Nurture-First Agent Development: Building Domain-Expert AI Agents Through Conversational Knowledge Crystallization
2026-03-169Paper Review

[논문 리뷰] Nurture-First Agent Development: Building Domain-Expert AI Agents Through Conversational Knowledge Crystallization

The emergence of large language model (LLM)-based agent frameworks has shifted the primary challenge in building domain-expert AI agents from raw capability to effective encoding of domain expertise. ...

Paper Review
cs.AI
cs.HC
+1
[논문 리뷰] Reinforced Generation of Combinatorial Structures: Ramsey Numbers
2026-03-169Paper Review

[논문 리뷰] Reinforced Generation of Combinatorial Structures: Ramsey Numbers

We present improved lower bounds for five classical Ramsey numbers: $\mathbf{R}(3, 13)$ is increased from $60$ to $61$, $\mathbf{R}(3, 18)$ from $99$ to $100$, $\mathbf{R}(4, 13)$ from $138$ to $139$,...

Paper Review
math.CO
cs.AI
+1
[논문 리뷰] LLM2Vec-Gen: Generative Embeddings from Large Language Models
2026-03-1510Paper Review

[논문 리뷰] LLM2Vec-Gen: Generative Embeddings from Large Language Models

LLM-based text embedders typically encode the semantic content of their input. However, embedding tasks require mapping diverse inputs to similar outputs. Typically, this input-output is addressed by ...

Paper Review
cs.CL
cs.CL
[논문 리뷰] OpenClaw-RL: Train Any Agent Simply by Talking
2026-03-159Paper Review

[논문 리뷰] OpenClaw-RL: Train Any Agent Simply by Talking

Every agent interaction generates a next-state signal, namely the user reply, tool output, terminal or GUI state change that follows each action, yet no existing agentic RL system recovers it as a liv...

Paper Review
cs.CL
cs.CL
[논문 리뷰] LLM2Vec-Gen: Generative Embeddings from Large Language Models
2026-03-148Paper Review

[논문 리뷰] LLM2Vec-Gen: Generative Embeddings from Large Language Models

LLM-based text embedders typically encode the semantic content of their input. However, embedding tasks require mapping diverse inputs to similar outputs. Typically, this input-output is addressed by ...

Paper Review
cs.CL
cs.CL
[논문 리뷰] Nurture-First Agent Development: Building Domain-Expert AI Agents Through Conversational Knowledge Crystallization
2026-03-128Paper Review

[논문 리뷰] Nurture-First Agent Development: Building Domain-Expert AI Agents Through Conversational Knowledge Crystallization

The emergence of large language model (LLM)-based agent frameworks has shifted the primary challenge in building domain-expert AI agents from raw capability to effective encoding of domain expertise. ...

Paper Review
cs.AI
cs.HC
+1
...