We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.
You must be logged in to block users.
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Youtu-Tip: Tap for Intelligence, Keep on Device.
Python 593 66
MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation
Python 29 2
[EMNLP24] Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence
Python 12
SFT, DPO and Inference scripts for LLM
Python 6
[ACL25] RoleMRC: A Fine-Grained Composite Benchmark for Role-Playing and Instruction-Following
Python 8 1
[COLING25] Free-form Instruction-oriented Prompt Optimization with Preference Dataset and Modular Fine-tuning Schema
Python 5
There was an error while loading. Please reload this page.