Nvidia Just Published ChatQA 1.5, a Llama3 QA/RAG Finetune

by noiseinvacuumon 5/2/24, 6:01 PMwith 4 comments
by noiseinvacuumon 5/2/24, 6:01 PM

Looks quite a solid fine tune from benchmark results. I iwsh they had added benchmark comparison to GPT 4 Turbo as well.

From the huggingface page:

Model Details

We introduce Llama3-ChatQA-1.5, which excels at conversational question answering (QA) and retrieval-augumented generation (RAG). Llama3-ChatQA-1.5 is built using the training recipe from ChatQA (1.0), and it is built on top of Llama-3 base model. Additionally, we incorporate more conversational QA data to enhance its tabular and arithmatic calculation capability. Llama3-ChatQA-1.5 has two variants: Llama3-ChatQA-1.5-8B and Llama3-ChatQA-1.5-70B. Both models were originally trained using Megatron-LM, we converted the checkpoints to Hugging Face format. Other Resources