No “emergent behavior / aha moment” when retraining GPT-2 on FineWeb; warmup / “warm training” guidance requested · rasbt/LLMs-from-scratch · Discussion #889 · GitHub
Skip to content
Discussion options

You must be logged in to vote

Replies: 7 comments 5 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@rasbt
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@rasbt
Comment options

Comment options

You must be logged in to vote
3 replies
@talentJay-ux
Comment options

@d-kleine
Comment options

@talentJay-ux
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
4 participants