본문으로 건너뛰기

Blog

데이터 과학, 인공지능, 딥러닝에 관한 이야기

186개 중 1-12번째 포스트

[논문 리뷰] EvoSkill: Automated Skill Discovery for Multi-Agent Systems
2026-03-079Paper Review

[논문 리뷰] EvoSkill: Automated Skill Discovery for Multi-Agent Systems

Coding agents are increasingly used as general-purpose problem solvers, but their flexibility does not by itself confer the domain expertise needed for specialized tasks. Recent work addresses this th...

Paper Review
cs.AI
cs.MA
+1
[논문 리뷰] Evaluating Theory of Mind and Internal Beliefs in LLM-Based Multi-Agent Systems
2026-03-0724Paper Review

[논문 리뷰] Evaluating Theory of Mind and Internal Beliefs in LLM-Based Multi-Agent Systems

LLM-based MAS are gaining popularity due to their potential for collaborative problem-solving enhanced by advances in natural language comprehension, reasoning, and planning. Research in Theory of Min...

Paper Review
cs.MA
cs.AI
+1
[논문 리뷰] ParamMem: Augmenting Language Agents with Parametric Reflective Memory
2026-03-0717Paper Review

[논문 리뷰] ParamMem: Augmenting Language Agents with Parametric Reflective Memory

Self-reflection enables language agents to iteratively refine solutions, yet often produces repetitive outputs that limit reasoning performance. Recent studies have attempted to address this limitatio...

Paper Review
cs.LG
cs.MA
+1
[논문 리뷰] Large-scale online deanonymization with LLMs
2026-03-079Paper Review

[논문 리뷰] Large-scale online deanonymization with LLMs

We show that large language models can be used to perform at-scale deanonymization. With full Internet access, our agent can re-identify Hacker News users and Anthropic Interviewer participants at hig...

Paper Review
cs.CR
cs.AI
+1
[논문 리뷰] From SGD to Spectra: A Theory of Neural Network Weight Dynamics
2026-03-0710Paper Review

[논문 리뷰] From SGD to Spectra: A Theory of Neural Network Weight Dynamics

Deep neural networks have revolutionized machine learning, yet their training dynamics remain theoretically unclear-we develop a continuous-time, matrix-valued stochastic differential equation (SDE) f...

Paper Review
cs.LG
cs.LG
[논문 리뷰] ParamMem: Augmenting Language Agents with Parametric Reflective Memory
2026-03-058Paper Review

[논문 리뷰] ParamMem: Augmenting Language Agents with Parametric Reflective Memory

Self-reflection enables language agents to iteratively refine solutions, yet often produces repetitive outputs that limit reasoning performance. Recent studies have attempted to address this limitatio...

Paper Review
cs.LG
cs.MA
+1
[논문 리뷰] Stress Testing Deliberative Alignment for Anti-Scheming Training
2026-03-059Paper Review

[논문 리뷰] Stress Testing Deliberative Alignment for Anti-Scheming Training

Highly capable AI systems could secretly pursue misaligned goals -- what we call "scheming". Because a scheming AI would deliberately try to hide its misaligned goals and actions, measuring and mitiga...

Paper Review
cs.AI
cs.AI
[논문 리뷰] Vectorizing the Trie: Efficient Constrained Decoding for LLM-based Generative Retrieval on Accelerators
2026-03-019Paper Review

[논문 리뷰] Vectorizing the Trie: Efficient Constrained Decoding for LLM-based Generative Retrieval on Accelerators

Generative retrieval has emerged as a powerful paradigm for LLM-based recommendation. However, industrial recommender systems often benefit from restricting the output space to a constrained subset of...

Paper Review
cs.IR
cs.CL
+1
[논문 리뷰] A Very Big Video Reasoning Suite
2026-03-018Paper Review

[논문 리뷰] A Very Big Video Reasoning Suite

Rapid progress in video models has largely focused on visual quality, leaving their reasoning capabilities underexplored. Video reasoning grounds intelligence in spatiotemporally consistent visual env...

Paper Review
cs.CV
cs.AI
+1
[논문 리뷰] From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models
2026-02-289Paper Review

[논문 리뷰] From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models

As Large Multimodal Models (LMMs) scale up and reinforcement learning (RL) methods mature, LMMs have made notable progress in complex reasoning and decision making. Yet training still relies on static...

Paper Review
cs.CV
cs.CV
[논문 리뷰] MiroFlow: Towards High-Performance and Robust Open-Source Agent Framework for General Deep Research Tasks
2026-02-289Paper Review

[논문 리뷰] MiroFlow: Towards High-Performance and Robust Open-Source Agent Framework for General Deep Research Tasks

Despite the remarkable progress of large language models (LLMs), the capabilities of standalone LLMs have begun to plateau when tackling real-world, complex tasks that require interaction with externa...

Paper Review
cs.AI
cs.AI
[논문 리뷰] Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization
2026-02-2810Paper Review

[논문 리뷰] Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization

Recent deep research agents primarily improve performance by scaling reasoning depth, but this leads to high inference cost and latency in search-intensive scenarios. Moreover, generalization across h...

Paper Review
cs.CL
cs.CL
...