YYHDBL (Yihui You) · GitHub
Skip to content
View YYHDBL's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report YYHDBL

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
YYHDBL/README.md

About

I build agent systems.

Most of what I care about lives around agents: harness engineering, tool use, post-training, RL-flavored data loops, and the stuff that makes models hold up once they leave the demo stage.

I like working on both sides of the problem: shaping model behavior, and building the engineering around it so that behavior actually survives contact with real tasks.

Agents are just models with tools, memory, and consequences.


Now

Currently spending a lot of time on coding agents, agent workflows, trajectory filtering, evaluation harnesses, and fine-tuning pipelines.

Still learning. Still building. Still trying to make agents a little less fake-smart and a little more useful.


Featured Work

An open-source coding agent shaped by harness engineering.

Built around agent loops, tool execution control, prompt cache, context compression, and long-horizon task reliability.

Qwen3-1.7B Rule-Reasoning Post-Training

A small-model post-training project built with Qwen3-1.7B + LoRA.

Focused on staged SFT, answer-format control, CoT parsing, validation, and eval-driven iteration.

A multimodal hybrid RAG system for AIOps knowledge bases.

Built around dense+sparse retrieval, reranking, heterogeneous documents, and practical retrieval quality.


Contribution Snake

github contribution snake

Popular repositories Loading

  1. MyCodeAgent MyCodeAgent Public

    Claude Code like agent for study.

    Python 238 50

  2. nlp-agent-notes nlp-agent-notes Public

    Public notes on NLP algorithms and agent engineering.

    12

  3. HelloCodeAgentCli HelloCodeAgentCli Public

    **HelloAgents Code Agent CLI** 是一个基于 HelloAgents 框架开发的智能代码助手,提供类似 Claude Code/Codex 的交互体验,专注于本地代码仓库的安全智能操作。

    Python 9 1

  4. SynergyRestorer SynergyRestorer Public

    Single-Image Desnowing Algorithm Based on a Complementary Mixture-of-Experts and Agreement-Biased Sub-network Routing

    Python

  5. YYHDBL YYHDBL Public

  6. OpsFusion-RAG OpsFusion-RAG Public

    OpsFusion-RAG 是面向运维知识库的多模态混合检索系统,结合 BGE-M3 稠密+稀疏向量、Qdrant Hybrid Search、GLM/DeepSeek 生成与 Qwen-VL 图像描述,实现对 HTML/XML/图片等异构文档的路径注入、按需挂载图像语义、RRF+Cross-Encoder 重排,提供高精度、低幻觉的 AIOps 问答能力。

    Python