QiakaChi's Note
迷茫之中,定有伦理。
[论文笔记] Agentic Reasoning for Large Language Models
论文笔记
Agent
2026-01-30
关于 2025 下半年的小结……
其他
2026-01-03
[论文笔记] Scaling LLM Multi-turn RL with End-to-end Summarization-based Context Management
论文笔记
Agent
2026-01-03
[论文笔记] DeepSeek-OCR: Contexts Optical Compression
论文笔记
2025-11-26
[笔记] 0-1背包问题的 DP 状态转移方程与 Bellman 方程转换
其他
2025-11-09
[笔记] 初探Transformer
论文笔记
Transformer
2025-10-22
[论文笔记] WebPilot: A Versatile and Autonomous Multi-Agent System for Web Task Execution with Strategic Exploration
论文笔记
Agent
2025-10-10
[论文笔记] Agent Q Advanced Reasoning and Learning for Autonomous AI Agents (2)
论文笔记
Agent
2025-08-27
[论文笔记] Agent Q Advanced Reasoning and Learning for Autonomous AI Agents (1)
论文笔记
Agent
2025-08-27
[论文笔记] OS Agents-A Survey on MLLM-based Agents for General Computing Devices Use
论文笔记
Agent
2025-08-15
[论文笔记] Efficient Agent Training for Computer Use
论文笔记
Agent
2025-08-06
Hello, Gmeek!
documentation
2025-08-06