Vijayalakshmi

ML enthusiast

HOME
CATEGORIES
TAGS
ARCHIVES
ABOUT

Home Tags Optimization

Tag

Optimization 1

Fine-tune LLM using Reinforcement Learning from Human Feedback (RLHF) May 10, 2024

Recently Updated

Fine-tune LLM using Reinforcement Learning from Human Feedback (RLHF)
Full fine-tuning & PEFT techniques with LLMs
Dialogue summarization using zero shot, one shot and few shot inferences
Faster LLM fine-tuning with Unsloth and using QLORA technique for memory reduction

Trending Tags

LLM PEFT few shot full fine-tuning LoRA LORA one shot Optimization PPO QLORA

© 2024 Vijayalakshmi Manikandan. Some rights reserved.

Using the Chirpy theme for Jekyll.

Trending Tags

LLM PEFT few shot full fine-tuning LoRA LORA one shot Optimization PPO QLORA

A new version of content is available.