Blog

데이터 과학, 인공지능, 딥러닝에 관한 이야기

192개 중 1-12번째 포스트

2026-03-08•9분•Paper Review

[논문 리뷰] RealWonder: Real-Time Physical Action-Conditioned Video Generation

Current video generation models cannot simulate physical consequences of 3D actions like forces and robotic manipulations, as they lack structural understanding of how actions affect 3D scenes. We pre...

Paper Review

cs.CV

cs.AI

2026-03-08•8분•Paper Review

[논문 리뷰] Helios: Real Real-Time Long Video Generation Model

We introduce Helios, the first 14B video generation model that runs at 19.5 FPS on a single NVIDIA H100 GPU and supports minute-scale generation while matching the quality of a strong baseline. We mak...

Paper Review

cs.CV

2026-03-08•9분•Paper Review

[논문 리뷰] Phi-4-reasoning-vision-15B Technical Report

We present Phi-4-reasoning-vision-15B, a compact open-weight multimodal reasoning model, and share the motivations, design choices, experiments, and learnings that informed its development. Our goal i...

Paper Review

cs.AI

cs.CV

2026-03-08•17분•Paper Review

[논문 리뷰] Beyond Language Modeling: An Exploration of Multimodal Pretraining

The visual world offers a critical axis for advancing foundation models beyond language. Despite growing interest in this direction, the design space for native multimodal models remains opaque. We pr...

Paper Review

cs.CV

2026-03-08•9분•Paper Review

[논문 리뷰] Chain of World: World Model Thinking in Latent Motion

Vision-Language-Action (VLA) models are a promising path toward embodied intelligence, yet they often overlook the predictive and temporal-causal structure underlying visual dynamics. World-model VLAs...

Paper Review

cs.CV

cs.AI

2026-03-08•9분•Paper Review

[논문 리뷰] Understanding LoRA as Knowledge Memory: An Empirical Analysis

Continuous knowledge updating for pre-trained large language models (LLMs) is increasingly necessary yet remains challenging. Although inference-time methods like In-Context Learning (ICL) and Retriev...

Paper Review

cs.LG

2026-03-07•9분•Paper Review

[논문 리뷰] EvoSkill: Automated Skill Discovery for Multi-Agent Systems

Coding agents are increasingly used as general-purpose problem solvers, but their flexibility does not by itself confer the domain expertise needed for specialized tasks. Recent work addresses this th...

Paper Review

cs.AI

cs.MA

2026-03-07•24분•Paper Review

[논문 리뷰] Evaluating Theory of Mind and Internal Beliefs in LLM-Based Multi-Agent Systems

LLM-based MAS are gaining popularity due to their potential for collaborative problem-solving enhanced by advances in natural language comprehension, reasoning, and planning. Research in Theory of Min...

Paper Review

cs.MA

cs.AI

2026-03-07•17분•Paper Review

[논문 리뷰] ParamMem: Augmenting Language Agents with Parametric Reflective Memory

Self-reflection enables language agents to iteratively refine solutions, yet often produces repetitive outputs that limit reasoning performance. Recent studies have attempted to address this limitatio...

Paper Review

cs.LG

cs.MA

2026-03-07•9분•Paper Review

[논문 리뷰] Large-scale online deanonymization with LLMs

We show that large language models can be used to perform at-scale deanonymization. With full Internet access, our agent can re-identify Hacker News users and Anthropic Interviewer participants at hig...

Paper Review

cs.CR

cs.AI

2026-03-07•10분•Paper Review

[논문 리뷰] From SGD to Spectra: A Theory of Neural Network Weight Dynamics

Deep neural networks have revolutionized machine learning, yet their training dynamics remain theoretically unclear-we develop a continuous-time, matrix-valued stochastic differential equation (SDE) f...

Paper Review

cs.LG

2026-03-05•8분•Paper Review

[논문 리뷰] ParamMem: Augmenting Language Agents with Parametric Reflective Memory

Paper Review

cs.LG

cs.MA

...