Home
Tags
Optimization
Tag
Cancel
Optimization
1
Fine-tune LLM using Reinforcement Learning from Human Feedback (RLHF)
May 10, 2024
Trending Tags
LLM
PEFT
few shot
full fine-tuning
LoRA
LORA
one shot
Optimization
PPO
QLORA