大è¯è¨æ¨¡åé·å ¥é误代ç å¾ªç¯ | AIçæåç¿»è¯
Question: LLM-based code assistantsï¼å¦ Claude Code æ Codexï¼æ¯å¦å®¹æé·å ¥éå¤å¤±è´¥å¾ªç¯ï¼ä»è强åéè¯¯æ¹æ³ï¼è䏿¯éå䏿¥å°è¯æ ¹æ¬ä¸åçè·¯å¾ï¼
Answer:
æ¯çï¼ä½ æåºäºä¸ä¸ªé常çå®ä¸å·²å åè®°å½çå±éæ§ãè¿ææ¶å¨ LLM agents çä¸ä¸æä¸è¢«ç§°ä¸º âsolution fixationâ æ âlocal minima trappingâã
为ä»ä¹ä¼åçè¿ç§æ åµ
1. Autoregressive Momentum
LLM æ¯é¡ºåºçæ token çã䏿¦ä¸ä¸æä¸ç¡®ç«äºä¸ä¸ªâæ¹åâï¼æ¨¡åå°±ä¼å¨ç»è®¡ä¸è¢«æåç»§ç»é£ä¸ªæ¹åãæ¯ä¸ªæ° token é½ä¼æ¡ä»¶åä¸ä¸ä¸ªââå æ¤ï¼æ©æçé误å设ä¼åå累积ï¼ä½¿å ¶æ´é¾éè±ã
2. In-Context Reinforcement
模åå¨ä¸ä¸æçªå£ä¸çå°èªå·±ä¹åçå°è¯ã妿å®å°è¯äºä¸æ¬¡ Solution Aï¼ä¸ä¸æç°å¨å æ»¡äº Solution A çæ¨çââè¿è®½åºå°è®©æ¨¡åå¨ä¸ä¸æ¬¡å°è¯æ¶è§å¾ Solution A æ´å âæ£ç¡®âã
3. 没æçæ£çå 认ç¥
人类ä¸å®¶å¨å¡ä½æ¶ï¼å¯ä»¥è·³åºé®é¢å¹¶æèï¼
âççï¼æçè³æ¯å¨è§£å³æ£ç¡®çé®é¢åï¼â
LLM 模æè¿ä¸ç¹ï¼ä½å¹¶æ²¡æçæ£åå°ãå®ä»¬çâåæâæ¬èº«åªæ¯æ´å¤åç¸å缺é·ä¸ä¸æå½±åç token 颿µã
4. é»è®¤æ²¡æå溯
䏿 æç´¢ç®æ³ï¼å¦ MCTSãA*ï¼ä¸åï¼æ åç LLM æ¨çæ¯ä¸ä¸ªååååè¿ç¨ã没æåçæºå¶è¯´âæ¾å¼è¿ä¸ªåæ¯ï¼åå°ååç¹âã
âåºæ§ç人âç±»æ¯å¾åç¡®
ä½ çç±»æ¯å¾çå©ãå®è¡¨ç°å¾åä¸ä¸ªäººï¼
- å ååæèªå·±çè§£é
- ç¨ç»å¾®åä½å°è¯ç¸åçäºæ
- è¶æ·±å ¥è¶èªä¿¡ï¼æ´å¤ token = æ´å¤âæ¿è¯ºâï¼
- 没æå¤é¨å¹²é¢ï¼æ æ³ä»å ¨æ°è§åº¦çå¾ é®é¢
å®é ææççç¥
What Actually Helps
æ´æ·±å±é®é¢ï¼Agentic Loops ä¼è®©æ 嵿´ç³
å¨å Claude Code è¿æ ·çå·¥å ·ä¸ï¼agent å¤äºä¸ä¸ªå¾ªç¯ä¸ââå®è¿è¡ä»£ç ï¼çå°è¾åºï¼åè¯ä¸æ¬¡ã妿åå§æ¹æ³é误ï¼å®å¯è½ä¼è¿ä»£æ°å次毫æ è¿å±ï¼å 为ï¼
- æ¯æ¬¡å¤±è´¥å°è¯é½ä¼åä¸ä¸æä¸æ·»å æ´å¤è¯¥æ¹æ³çâè¯æ®â
- å¥å±ä¿¡å·ï¼éè¯¯æ¶æ¯ï¼éè¿ç¸åç缺é·è§è§è¢«è§£é
- å®ä¼åçæ¯åå°éè¯¯æ¶æ¯ï¼è䏿¯éæ°æèæ¶æ
è¿æ¯ AI agent ç ç©¶ä¸çä¸ä¸ªå·²ç¥å¼æ¾é®é¢ââææ¶ç§°ä¸º âcontext poisoningâ æ âperseverationâã
åºçº¿
ä½ æ¯å¯¹çãå½ä» LLM 卿¬è´¨ä¸æ´æ é¿å©ç¨èéæ¢ç´¢ââå®ä»¬æ é¿å®åè·¯å¾ï¼ä½ä¸æ é¿æ¾å¼å®ã人类éè¦å å½âé大åæâ触åå¨ï¼å¼ºå¶æ¨¡åæ æ³èªè¡å®æçéç½®ã
