Validation of Fine-Tuning and Inference Methods for Multi-Turn Conversations with LLaMA 3.1 8B #1564

Kshitiz-Khandel · 2025-01-20T08:22:29Z

I'm fine-tuning LLaMA 3.1 8B for multi-turn conversations and using this colab notebook as reference (which focuses on single-turn conversations).

Question 1:
Can you confirm whether the data format below is correct for preparing fine-tuning data for multi-turn conversations?

Question 2:
Do I still use train_on_responses method to only train on the assistant outputs and ignore the loss on the user's inputs given the conversations are multi-turn?

Question3.
For inference on a multi-turn conversation, is the following function the correct way to prepare input data? If not, can you suggest improvements or confirm its correctness?

from unsloth.chat_templates import get_chat_template

tokenizer = get_chat_template(
tokenizer,
chat_template="llama-3.1",
)

FastLanguageModel.for_inference(model)

messages = [
{'content': 'What brings you back into the clinic today, miss?', 'role': 'assistant'},
{'content': 'I came in for a refill of my blood pressure medicine.', 'role': 'user'},
{'content': 'It looks like Doctor Kumar followed up with you last time regarding your hypertension, osteoarthritis, osteoporosis, hypothyroidism, allergic rhinitis, and kidney stones. Have you noticed any changes or do you have any concerns regarding these issues?', 'role': 'assistant'},
{'content': 'No.', 'role': 'user'}
]

inputs = tokenizer.apply_chat_template(
messages,
tokenize=True,
add_generation_prompt=True, # Required for generation
return_tensors="pt",
).to("cuda")

outputs = model.generate(
input_ids=inputs,
max_new_tokens=64,
use_cache=True,
temperature=1.5,
min_p=0.1
)

tokenizer.batch_decode(outputs)

Would you kindly validate if these approaches are appropriate for multi-turn fine-tuning and inference?

danielhanchen · 2025-01-28T11:07:18Z

Yes all look correct! Apologies on the delay!

Kshitiz-Khandel changed the title ~~Validation of Fine-Tuning and Inference Methods for Multi-Turn Conversations with LLaMA 3.1 8B"~~ Validation of Fine-Tuning and Inference Methods for Multi-Turn Conversations with LLaMA 3.1 8B Jan 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Validation of Fine-Tuning and Inference Methods for Multi-Turn Conversations with LLaMA 3.1 8B #1564

Validation of Fine-Tuning and Inference Methods for Multi-Turn Conversations with LLaMA 3.1 8B #1564

Kshitiz-Khandel commented Jan 20, 2025 •

edited

Loading

danielhanchen commented Jan 28, 2025

Validation of Fine-Tuning and Inference Methods for Multi-Turn Conversations with LLaMA 3.1 8B #1564

Validation of Fine-Tuning and Inference Methods for Multi-Turn Conversations with LLaMA 3.1 8B #1564

Comments

Kshitiz-Khandel commented Jan 20, 2025 • edited Loading

danielhanchen commented Jan 28, 2025

Kshitiz-Khandel commented Jan 20, 2025 •

edited

Loading