You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
Fine-tune the recent Llama-2-7b model on a single GPU and turn it into a chatbot I will leverage PEFT library from Hugging Face ecosystem, as well as QLoRA for more memory efficient finetuning.