We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.
You must be logged in to block users.
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
rl from zero pretrain, can it be done? yes.
Python 294 21
one page guides that i let my subscribed/customised agents consume to perform actions
Python 362 30
a rubric driven prioritized replay rl algo to maximise continual learning
Python 16 4
implementations and experimentation on mHC by deepseek - https://arxiv.org/abs/2512.24880
Shell 364 34
import os
import sys
import time
import math
import pickle
There was an error while loading. Please reload this page.