Paper Reading ๐Ÿ“œ/Natural Language Processing

Vicuna๐Ÿช: An Open-Source Chatbot Impressing GPT-4 ๋ฆฌ๋ทฐ

Cartinoe 2023. 6. 17. 14:21

The overview of 'Vicuna'

 Vicuna 13B๋Š” ShareGPT๋กœ๋ถ€ํ„ฐ ์ˆ˜์ง‘๋œ user-shared ๋Œ€ํ™”์—์„œ fine-tuned LLaMA์—์„œ ํ•™์Šต๋œ open-source ์ฑ—๋ด‡์ด๋‹ค. GPT-4๋ฅผ ํ‰๊ฐ€์ž๋กœ ์‚ฌ์šฉํ•œ ์‚ฌ์ „ ํ‰๊ฐ€๋Š” Vicuna-13B๊ฐ€ OpenAI ChatGPT์™€ Google Bard์˜ 90%์— ํ•ด๋‹นํ•˜๋Š” ํ€„๋ฆฌํ‹ฐ๋ฅผ ๋‹ฌ์„ฑํ•˜๋Š” ๋ฐ˜๋ฉด LLaMA์™€ Alpaca๋ณด๋‹ค 90%์˜ ๊ฒฝ์šฐ์— ๋” ๋‚˜์€ ๋ชจ์Šต์„ ๋ณด์—ฌ์คฌ๋‹ค. Vicuna-13B์˜ ํ•™์Šต ๋น„์šฉ์€ 300$ ์ •๋„์ด๋‹ค. ๊ทธ๋ฆฌ๊ณ  Vicuna์˜ ์ฝ”๋“œ์™€ ๊ฐ€์ค‘์น˜๋Š” ๋น„์ƒ์—…์  ์‚ฌ์šฉ์— ํ•œํ•ด์„œ ๊ณต๊ฐœ๋˜์—ˆ๋‹ค.

 

Vicuna

 

How Good Is Vicuna?

 

 70K user-shared ChatGPT ๋Œ€ํ™”๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ Vicuna๋ฅผ fine-tuning ํ•œ ํ›„์—, Vicuna๋Š” Alpaca์™€ ๋น„๊ตํ•ด์„œ ๋”์šฑ ๋””ํ…Œ์ผํ•˜๊ณ  ์ž˜ ์งœ์ธ ์‘๋‹ต์„ ๋งŒ๋“ค ์ˆ˜ ์žˆ์–ด์กŒ๋‹ค.

 

VIcuna Evaluation

 

 ํ•˜์ง€๋งŒ ์ฑ—๋ด‡์„ ํ‰๊ฐ€ํ•˜๋Š” ๊ฒƒ์€ ๊ฐ„๋‹จํ•œ task๊ฐ€ ์•„๋‹ˆ๋‹ค. ์ตœ๊ทผ์˜ GPT-4์˜ ๋ฐœ์ „๊ณผ ํ•จ๊ป˜ GPT-4์˜ ๋Šฅ๋ ฅ์ด ๋ฒค์น˜๋งˆํฌ ์ƒ์„ฑ๊ณผ ์„ฑ๋Šฅ ํ‰๊ฐ€๋ฅผ ์œ„ํ•œ ๋‹ค๋™ํ™”๋œ ํ‰๊ฐ€๋ฅผ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•ด์ฃผ๋Š” human-like level์— ๋„๋‹ฌํ•  ์ˆ˜ ์žˆ๋Š”์ง€ ์—†๋Š”์ง€ ๊ถ๊ธˆํ•˜์˜€๋‹ค. ์‹คํ—˜์„ ํ†ตํ•ด ๋ฐœ๊ฒฌํ•œ ์ ์€ GPT-4๊ฐ€ ์ฑ—๋ด‡ ๋Œ€๋‹ต๊ณผ ๋น„๊ตํ•  ๋•Œ ๊ฝค ์ผ๊ด€๋œ ๋žญํฌ์™€ ๋””ํ…Œ์ผํ•œ ํ‰๊ฐ€๋ฅผ ์ƒ์„ฑํ•  ์ˆ˜ ์žˆ๋‹ค๋Š” ๊ฒƒ์„ ๊ฐ€๋ฆฌํ‚จ๋‹ค. GPT-4์˜ ํ‰๊ฐ€์— ๊ธฐ๋ฐ˜ํ•ด์„œ ์š”์•ฝ๋œ ๊ทธ๋ฆผ 1์—์„œ๋Š” Vicuna๊ฐ€ Bard/ChatGPT์˜ 90%์— ๋‹ฌํ•˜๋Š” ๋Šฅ๋ ฅ์„ ๋‹ฌ์„ฑํ•œ๋‹ค๋Š” ๊ฒƒ์„ ๋ฐœ๊ฒฌํ•˜์˜€๋‹ค. ์ด ์ œ์•ˆ๋œ ํ”„๋ ˆ์ž„์›Œํฌ๋Š” ์ฑ—๋ด‡ ํ‰๊ฐ€๋ฅผ ์ž๋™ํ™”ํ•˜๊ธฐ ์œ„ํ•œ ์ž ์žฌ๋ ฅ์„ ๋ณด์—ฌ์ฃผ์ง€๋งŒ, ์•„์ง ์™„๋ฒฝํ•œ ์ ‘๊ทผ๋ฒ•์€ ์•„๋‹ˆ๋‹ค. ์ฑ—๋ด‡์„ ์œ„ํ•œ ํ‰๊ฐ€ ์‹œ์Šคํ…œ์„ ๋งŒ๋“œ๋Š” ๊ฒƒ์€ ์•„์ง open question์ด์–ด์„œ ํ–ฅํ›„ ์—ฐ๊ตฌ๋ฅผ ํ•„์š”๋กœ ํ•œ๋‹ค.

 

๊ทธ๋ฆผ 1. GPT-4์— ์˜ํ•ด ํ‰๊ฐ€๋œ ๋น„๊ต ์‘๋‹ต ํ€„๋ฆฌํ‹ฐ

 

Overview

 

 ์ตœ๊ทผ์— LLM์€ ์—„์ฒญ๋‚œ ๋ฐœ์ „์„ ๊ฑฐ๋“ญํ•˜๋ฉฐ ๋†€๋ผ์šด ์„ฑ๋Šฅ์„ ๋ณด์—ฌ์ฃผ๊ณ  ์žˆ์ง€๋งŒ, ๋Œ€๋ถ€๋ถ„์˜ ๋ชจ๋ธ์€ ํ•™์Šต ๋ฐฉ๋ฒ•๊ณผ ๊ตฌ์ฒด์ ์ธ architecture๊ฐ€ ๊ณต๊ฐœ๋˜์ง€ ์•Š์•„ ์ด ๋ถ„์•ผ์˜ ์—ฐ๊ตฌ์™€ open-sourceํ™”๋ฅผ ๋ฐฉํ•ดํ•˜๊ณ  ์žˆ๋‹ค. Meta์˜ LLaMA์™€ Stanford์˜ Alpaca ํ”„๋กœ์ ํŠธ์— ์˜๊ฐ์„ ๋ฐ›์•„์„œ ๊ฐœ์„ ๋œ ๋ฐ์ดํ„ฐ์…‹์œผ๋กœ ์ด๋ฃจ์–ด์ง€๊ณ , ์‰ฝ๊ฒŒ ์‚ฌ์šฉ๊ฐ€๋Šฅํ•˜๊ณ , scalable ํ•œ infrastructure๋ฅผ ๊ฐ–๋Š” open-source ์ฑ—๋ด‡์ธ Vicuna-13B๋ฅผ ์†Œ๊ฐœํ•˜์˜€๋‹ค. ShareGPT.com์œผ๋กœ๋ถ€ํ„ฐ ์ˆ˜์ง‘๋œ used-shared ๋Œ€ํ™”์—์„œ LLaMA base model์„ fine-tuning ํ•จ์œผ๋กœ์จ Vicuna-13B๋Š” Stanford Alpaca์™€ ๊ฐ™์€ open-source ๋ชจ๋ธ๊ณผ ๋น„๊ตํ•ด์„œ ๊ฒฌ์ค„ ๋งŒํ•œ ์„ฑ๋Šฅ์„ ๊ฐ–๊ฒŒ ๋˜์—ˆ๋‹ค.

 

๊ทธ๋ฆผ 2. Workflow ๊ฐœ์š”

 

 ๊ทธ๋ฆผ 2๋Š” Vicuna๊ฐ€ ๋งŒ๋“ค์–ด์ง„ ๊ฐœ์š”๋ฅผ ๋ณด์—ฌ์ค€๋‹ค. ์‹œ์ž‘์— ์•ž์„œ, ShareGPT.com์œผ๋กœ๋ถ€ํ„ฐ 70K ๊ฐœ์˜ ๋Œ€ํ™” ๋ฐ์ดํ„ฐ๋ฅผ ์ˆ˜์ง‘ํ•˜์˜€๋‹ค. ๊ทธ ๋‹ค์Œ์— multi-round ๋Œ€ํ™”์™€ long sequence๋ฅผ ๋”์šฑ ์ž˜ ๋‹ค๋ฃจ๊ธฐ ์œ„ํ•ด Alpaca์— ์˜ํ•ด ์ œ๊ณต๋˜๋Š” training script๋ฅผ ํ–ฅ์ƒ์‹œ์ผฐ๋‹ค. ๊ทธ๋ฆฌ๊ณ  80๊ฐœ์˜ ๋‹ค์–‘ํ•œ question set๋ฅผ ์ƒ์„ฑํ•˜๊ณ  model output์„ ํ‰๊ฐ€ํ•˜๊ธฐ ์œ„ํ•ด GPT-4๋ฅผ ํ™œ์šฉํ•จ์œผ๋กœ์จ ๋ชจ๋ธ ํ€„๋ฆฌํ‹ฐ์˜ ํ‰๊ฐ€๋ฅผ ์ˆ˜ํ–‰ํ•˜์˜€๋‹ค. 2๊ฐœ์˜ ์„œ๋กœ ๋‹ค๋ฅธ ๋ชจ๋ธ์„ ๋น„๊ตํ•˜๊ธฐ ์œ„ํ•ด ๊ฐ ๋ชจ๋ธ์˜ output์„ ๊ฐ question์— ๋Œ€ํ•œ ํ•˜๋‚˜์˜ prompt๋กœ ๋ฌถ์—ˆ๋‹ค. ๊ทธ๋‹ค์Œ์— ์ด prompt๋Š” GPT-4์— ๋ณด๋‚ด์ ธ์„œ ์–ด๋–ค ๋ชจ๋ธ์˜ ์‘๋‹ต์ด ๋” ๋‚˜์€์ง€ ํ‰๊ฐ€๋œ๋‹ค. LLaMA, Alpaca, ChatGPT, Vicuna์— ๋Œ€ํ•œ ๋””ํ…Œ์ผ์ด ํ‘œ 1์— ๋‚˜ํƒ€๋‚˜ ์žˆ๋‹ค.

 

ํ‘œ 1. ์—ฌ๋Ÿฌ ๋ชจ๋ธ ๊ฐ„์˜ ๋น„๊ต

 

Training

 

 Vicuna๋Š” public API์™€ ShareGPT.com์œผ๋กœ๋ถ€ํ„ฐ ์ˆ˜์ง‘๋œ ๊ฑฐ์˜ 70K ๊ฐœ์˜ user-shared ๋Œ€ํ™” ๋ฐ์ดํ„ฐ๋ฅผ ์‚ฌ์šฉํ•ด์„œ LLaMA base model์„ fine-tune ํ•จ์œผ๋กœ์จ ์ƒ์„ฑ๋˜์—ˆ๋‹ค. ๋ฐ์ดํ„ฐ ํ€„๋ฆฌํ‹ฐ๋ฅผ ๋ณด์žฅํ•˜๊ธฐ ์œ„ํ•ด HTML์„ ๋งˆํฌ๋‹ค์šด์œผ๋กœ ๋‹ค์‹œ ๋ณ€ํ™˜ํ•˜๊ณ  ๋ถ€์ ์ ˆํ•˜๊ฑฐ๋‚˜ low-quality ์ƒ˜ํ”Œ์„ ํ•„ํ„ฐ๋งํ•ด๋‚ธ๋‹ค. ์ถ”๊ฐ€์ ์œผ๋กœ ๊ธธ์ด๊ฐ€ ์žˆ๋Š” ๋Œ€ํ™”๋ฅผ ๋ชจ๋ธ์˜ ์ตœ๋Œ€ context length๋ฅผ ๋งŒ์กฑํ•˜๋Š” smaller segment๋กœ ๋‚˜๋ˆด๋‹ค.

 

 Vicuna์˜ training recipe๋Š” Stanford Alapca์˜ ์œ„์— ๋‹ค์Œ์˜ ๊ฐœ์„ ์ ์„ ์ถ”๊ฐ€ํ•˜์˜€๋‹ค.

 

  • Memory Optimizations: Vicuna์˜ long context ์ดํ•ด๋ฅผ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•˜๊ธฐ ์œ„ํ•ด max context length๋ฅผ Alpaca์—์„œ ์‚ฌ์šฉํ•œ 512์—์„œ 2048๋กœ ๋Š˜๋ ธ๋‹ค. ์ด๊ฒƒ์€ GPU์˜ ํ•„์š”๋ฅผ ์ƒ๋‹นํžˆ ์ฆ๊ฐ€์‹œํ‚ค๋Š”๋ฐ ์ด๋Š” gradient checkpointing๊ณผ flash attention์„ ์‚ฌ์šฉํ•จ์œผ๋กœ์จ ํ•ด๊ฒฐํ•˜์˜€๋‹ค.
  • Multi-round Conversation: multi-round ๋Œ€ํ™”๋ฅผ ์„ค๋ช…ํ•˜๊ณ  ์ฑ—๋ด‡์˜ output์—์„œ fine-tuning์˜ loss๋ฅผ ๊ณ„์‚ฐํ•˜๊ธฐ ์œ„ํ•ด training loss๋ฅผ ์กฐ์ •ํ•˜์˜€๋‹ค.
  • Cost Reduction via Spot Instance: training์„ ์œ„ํ•œ 40๋ฐฐ ๋” ํฐ ๋ฐ์ดํ„ฐ์…‹๊ณผ 4๋ฐฐ ๋” ๊ธด ์‹œํ€€์Šค์˜ ๊ธธ์ด๋Š” ํ•™์Šต ๋น„์šฉ ์ธก๋ฉด์—์„œ ์ƒ๋‹นํ•œ ์–ด๋ ค์›€์„ ํ‘œ์ถœํ•œ๋‹ค. ๊ทธ๋ฆฌ๊ณ  ์„ ์ทจ๊ถŒ์„ ์œ„ํ•œ auto-recovery๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ๋”์šฑ ์‹ผ spot instance์™€ aito zone switch๋ฅผ ํ™œ์šฉํ•จ์œผ๋กœ์จ cost๋ฅผ ์ค„์ด๊ธฐ ์œ„ํ•œ SkyPilot managed spot์„ ์‚ฌ์šฉํ•˜์˜€๋‹ค.

 

How To Evaluate a Chatbot?

 

 AI ์ฑ—๋ด‡์„ ํ‰๊ฐ€ํ•˜๋Š” task๋Š” ์ƒ๋‹นํžˆ ์–ด๋ ค์šด๋ฐ, ์–ธ์–ด ์ดํ•ด์™€ ์ถ”๋ก , ๋ฌธ๋งฅ ์ดํ•ด๋ฅผ ์š”๊ตฌํ•˜๊ธฐ ๋•Œ๋ฌธ์ด๋‹ค. AI ์ฑ—๋ด‡์ด ๋”์šฑ ๋ฐœ์ „๋จ์— ๋”ฐ๋ผ ํ˜„์žฌ์˜ open ๋ฒค์น˜๋งˆํฌ๋Š” ๋” ์ด์ƒ ์ถฉ๋ถ„ํ•˜์ง€ ์•Š์„ ์ˆ˜๋„ ์žˆ๋‹ค. ์ด๋Ÿฌํ•œ ๋ฌธ์ œ์ ์„ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด ์ฑ—๋ด‡ ์„ฑ๋Šฅ ํ‰๊ฐ€๋ฅผ ์ž๋™ํ™”ํ•˜๊ธฐ ์œ„ํ•ด GPT-4์— ๊ธฐ๋ฐ˜์„ ๋‘” ํ‰๊ฐ€ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•˜์˜€๋‹ค.

 

 ์ฒซ ๋ฒˆ์งธ๋กœ ์ฑ—๋ด‡ ์„ฑ๋Šฅ์˜ ๋‹ค์–‘ํ•œ ์ธก๋ฉด์„ ๋ฐ์ŠคํŠธํ•˜๊ธฐ ์œ„ํ•œ 8๊ฐœ์˜ question ์นดํ…Œ๊ณ ๋ฆฌ๋ฅผ ๊ณ ์•ˆํ•˜์˜€๋‹ค. ์‹ ์ค‘ํ•œ prompt engineering์„ ํ†ตํ•ด GPT-4๋Š” baseline model์ด ์–ด๋ ค์›€์„ ๊ฒช๋Š” ๋‹ค์–‘ํ•˜๊ณ  ์–ด๋ ค์šด question์„ ์ƒ์„ฑํ•  ์ˆ˜ ์žˆ์—ˆ๋‹ค. ๊ฐ ์นดํ…Œ๊ณ ๋ฆฌ ๋‹น 10๊ฐœ์˜ question์„ ์„ ํƒํ•˜๊ณ  5๊ฐœ์˜ ์ฑ—๋ด‡(LLaMA, Alapca, ChatGPT, Bard, Vicuna)์œผ๋กœ๋ถ€ํ„ฐ ์‘๋‹ต์„ ์ˆ˜์ง‘ํ•˜์˜€๋‹ค. ๊ทธ๋‹ค์Œ์— GPT-4์—๊ฒŒ ๋ฌผ์–ด๋ด์„œ ์ด๋“ค์˜ ์‘๋‹ต์„ helpfulness, relevance, accuracy, detail์— ๊ธฐ๋ฐ˜ํ•ด์„œ ํ€„๋ฆฌํ‹ฐ๋ฅผ ํ‰๊ฐ€ํ•˜์˜€๋‹ค. GPT-4๋Š” ๋น„๊ต์  ์ผ๊ด€์ ์ธ score๋ฅผ ์‚ฐ์ถœํ•  ๋ฟ๋งŒ ์•„๋‹ˆ๋ผ ์™œ ์ด๋Ÿฐ score๊ฐ€ ์ฃผ์–ด์กŒ๋Š”์ง€์— ๋Œ€ํ•œ ๋””ํ…Œ์ผํ•œ ์„ค๋ช…๋„ ์ œ๊ณตํ•ด ์ค€๋‹ค. ํ•˜์ง€๋งŒ, GPT-4๋„ ์ฝ”๋”ฉ/์ˆ˜ํ•™ task๋ฅผ ํ‰๊ฐ€ํ•˜๋Š” ๋ฐ๋Š” ๋งค์šฐ ์ข‹์ง€ ์•Š์•˜๋‹ค.

 

๊ทธ๋ฆผ 3. GPT-4์— ์˜ํ•ด ํ‰๊ฐ€๋œ ์‘๋‹ต์˜ ๋น„๊ต

 

 ๊ทธ๋ฆผ 3์€ ๋ชจ๋“  baseline๊ณผ Vicuna ๊ฐ„์˜ ๋น„๊ต ๊ฒฐ๊ณผ๋ฅผ ๋ณด์—ฌ์ค€๋‹ค. GPT-4๋Š” Vicuna๋ฅผ ๊ธฐ์กด SoTA open-source ๋ชจ๋ธ(LLaMA, Alapca)๋ณด๋‹ค 90% ์ด์ƒ์˜ question์—์„œ ๋” ์„ ํ˜ธํ•˜์˜€๊ณ , ์ƒ์—…์šฉ ๋ชจ๋ธ๊ณผ๋„ ๊ฒฌ์ค„ ๋งŒํ•œ ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ•˜์˜€๋‹ค. GPT-4๋Š” ํ€„๋ฆฌํ‹ฐ score๋ฅผ ์ด 10์ ์œผ๋กœ ํ•ด์„œ ํ‰๊ฐ€ํ•˜๊ธฐ ๋•Œ๋ฌธ์—, ๊ฐ ๋น„๊ต ์Œ(baseline, Vicuna)๋ฅผ 80๊ฐœ์˜ question์—์„œ ๊ฐ ๋ชจ๋ธ์— ์˜ํ•ด ์–ป์–ด์ง„ score๋ฅผ ์ข…ํ•ฉํ•ด์„œ total score๋ฅผ ๋น„๊ตํ•˜์˜€๋‹ค. ํ‘œ 2์—์„œ ๋ณด์ด๋Š” ๊ฒƒ์ฒ˜๋Ÿผ Vicuna์˜ total score๋Š” ChatGPT์˜ 92%์ด๋‹ค. 

 

ํ‘œ 2. GPT-4์— ์˜ํ•ด ํ‰๊ฐ€๋œ ์ข…ํ•ฉ ์Šค์ฝ”์–ด

 

 ์ด ์ œ์•ˆ๋œ ํ‰๊ฐ€ ํ”„๋ ˆ์ž„์›Œํฌ๋Š” ์ฑ—๋ด‡์„ ํ‰๊ฐ€ํ•˜๊ธฐ ์œ„ํ•œ ์ž ์žฌ๋ ฅ์„ ๋ณด์—ฌ์ฃผ์ง€๋งŒ, ์•„์ง LLM์ด hallucinate๋ฅผ ์ผ์œผํ‚ค๋Š” ๊ฒƒ์ฒ˜๋Ÿผ ์•„์ง ์™„๋ฒฝํ•œ ๋ฐฉ๋ฒ•์€ ์•„๋‹ˆ๋‹ค. ์ณ‡๋ด‡์„ ์œ„ํ•œ ์ข…ํ•ฉ์ ์ด๊ณ , ๊ธฐ์ค€ํ™” ๋œ ํ‰๊ฐ€ ์‹œ์Šคํ…œ์€ ์ถ”๊ฐ€์ ์ธ ์—ฐ๊ตฌ๋ฅผ ํ•„์š”๋กœ ํ•˜๋Š” open question์œผ๋กœ ๋‚จ์•„์žˆ๋‹ค.

 

Limitations

 

 ๋‹ค๋ฅธ LLM๊ณผ ์œ ์‚ฌํ•˜๊ฒŒ Vicuna๋Š” ํŠน์ • ํ•œ๊ณ„๋ฅผ ๊ฐ€์ง„๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด ์ถ”๋ก  ๋˜๋Š” ์ˆ˜ํ•™ task์— ๋Œ€ํ•ด ๋ณ„๋กœ ์ข‹์ง€ ๋ชปํ•˜๊ณ , ์ž์‹ ์˜ output์˜ ์‚ฌ์‹ค์  ์ •ํ™•๋„๋ฅผ ํƒ์ง€ํ•˜๋Š”๋ฐ ์–ด๋ ค์›€์„ ๊ฒช๋Š”๋‹ค. ๊ทธ๋ฆฌ๊ณ  ์•„์ง ์ถฉ๋ถ„ํ•˜๊ฒŒ safety๋ฅผ ๋ณด์žฅํ•˜๊ฑฐ๋‚˜ ์ž ์žฌ์  toxiciry ๋˜๋Š” bias๋ฅผ ์™„ํ™”ํ•˜๋„๋ก ์ตœ์ ํ™”ํ•˜์ง€ ์•Š์•˜๋‹ค. ๊ทธ๋Ÿผ์—๋„ ๋ถˆ๊ตฌํ•˜๊ณ  Vicuna๊ฐ€ ์ด๋Ÿฌํ•œ ํ•œ๊ณ„์ ๋“ค์„ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•œ ํ–ฅํ›„ ์—ฐ๊ตฌ์˜ ์‹œ์ž‘์ ์œผ๋กœ ์—ฌ๊ฒจ์งˆ ๊ฒƒ์ด๋ผ๊ณ  ์˜ˆ์ƒํ•œ๋‹ค.

 

 

 

 

์ถœ์ฒ˜

https://lmsys.org/blog/2023-03-30-vicuna/

 

Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90%* ChatGPT Quality | LMSYS Org

<p>We introduce Vicuna-13B, an open-source chatbot trained by fine-tuning LLaMA on user-shared conversations collected from ShareGPT. Preliminary evaluation ...

lmsys.org