Lecture ๐Ÿง‘โ€๐Ÿซ/Coursera

[Machine Learning] Machine Learning Algorithm Application

2023. 3. 28. 11:03

Prioritizing What to Work On

System Desing Example:

 

 ์ŠคํŒธ ๋ฉ”์ผ์„ ๋ถ„๋ฅ˜ํ•œ๋‹ค๊ณ  ํ•  ๋•Œ, ์ด๋ฉ”์ผ ์„ธํŠธ๊ฐ€ ์ฃผ์–ด์ง€๋ฉด ๊ฐ ์ด๋ฉ”์ผ์— ๋Œ€ํ•œ ๋ฒกํ„ฐ๋ฅผ ๋งŒ๋“ค์–ด์•ผ ํ•œ๋‹ค. ์ด ๋ฒกํ„ฐ์˜ ๊ฐ๊ฐ์˜ entry๋Š” ๋‹จ์–ด๋“ค์„ ๋‚˜ํƒ€๋‚ธ๋‹ค. ๋ฒกํ„ฐ๋Š” ์ผ๋ฐ˜์ ์œผ๋กœ ๋ฐ์ดํ„ฐ์…‹์—์„œ ํ”ํ•˜๊ฒŒ ๋ฐœ๊ฒฌ๋˜๋Š” ๋‹จ์–ด๋“ค์„ ๋ชจ์•„์„œ 10,000๊ฐœ์—์„œ 50,000๊ฐœ์˜ entry๋ฅผ ํฌํ•จํ•˜๊ณ  ์žˆ๋‹ค. ๋งŒ์•ฝ ์ด๋ฉ”์ผ์—์„œ ๋‹จ์–ด๊ฐ€ ์ฐพ์•„์ง€๋ฉด, ์ด์— ๋Œ€ํ•œ entry๋ฅผ 1๋กœ ํ•˜๊ณ , ์ฐพ์•„์ง€์ง€ ์•Š์œผ๋ฉด entry๋ฅผ 0์œผ๋กœ ํ•œ๋‹ค. $x$ ๋ฒกํ„ฐ๋“ค์ด ๋ชจ๋‘ ์ค€๋น„๋˜๋ฉด ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ํ•™์Šต์‹œํ‚ค๊ณ  ์ตœ์ข…์ ์œผ๋กœ ์ด๋ฉ”์ผ์— ์ ์šฉํ•ด์„œ ์ŠคํŒธ์ธ์ง€ ์•„๋‹Œ์ง€๋ฅผ ๋ถ„๋ฅ˜ํ•˜๋Š”๋ฐ ์‚ฌ์šฉํ•œ๋‹ค.

 

 

 ์–ด๋–ป๊ฒŒ ํ•˜๋ฉด ๋ถ„๋ฅ˜๊ธฐ์˜ ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚ฌ ์ˆ˜ ์žˆ์„๊นŒ?

 

  • ๋งŽ์€ ๋ฐ์ดํ„ฐ๋ฅผ ์ˆ˜์ง‘ํ•˜๊ธฐ
  • ์ •๊ตํ•œ feature ์‚ฌ์šฉ$($ex. ์ŠคํŒธ ๋ฉ”์ผ์˜ ์ด๋ฉ”์ผ ํ—ค๋” ์‚ฌ์šฉ$)$
  • ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ๋ฐœ์ „์‹œ์ผœ์„œ ์ž…๋ ฅ๊ฐ’์„ ๋‹ค๋ฅธ ๋ฐฉ์‹์œผ๋กœ ์ฒ˜๋ฆฌ$($์ŠคํŒธ ๋ฉ”์ผ์—์„œ misspelling์„ ์ธ์‹ํ•˜๊ฒŒ ํ•˜๊ธฐ$)$

 

 ์–ด๋– ํ•œ ์˜ต์…˜์ด ๋” ๋„์›€์ด ๋˜๋Š”์ง€๋Š” ๋‹จ์–ธํ•  ์ˆ˜ ์—†๋‹ค..

 

 

Error Analysis

 ๋จธ์‹ ๋Ÿฌ๋‹ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด ์ถ”์ฒœ๋œ ๋ฐฉ๋ฒ•๋“ค์€ ๋‹ค์Œ๊ณผ ๊ฐ™๋‹ค.

 

  • ๊ฐ„๋‹จํ•œ ์•Œ๊ณ ๋ฆฌ์ฆ˜์—์„œ ์‹œ์ž‘ํ•ด์„œ ๋น ๋ฅด๊ฒŒ ๊ตฌํ˜„ํ•˜๊ณ  cross validation data์—์„œ ํ…Œ์ŠคํŠธ๋ฅผ ํ•œ๋‹ค.
  • learning curve๋ฅผ ๊ทธ๋ ค์„œ ๋” ๋งŽ์€ ๋ฐ์ดํ„ฐ ๋˜๋Š” ๋” ๋งŽ์€ feature๊ฐ€ ๋„์›€์ด ๋  ์ง€๋ฅผ ํŒŒ์•…ํ•œ๋‹ค.
  • cross validation set์˜ example์—์„œ ๋ฐœ์ƒํ•œ ์˜ค๋ฅ˜๋“ค์„ ๊ฒ€์‚ฌํ•˜๊ณ  ๊ฐ€์žฅ ๋งŽ์€ ์˜ค๋ฅ˜๊ฐ€ ๋ฐœ์ƒํ•˜๋Š” ๋ถ€๋ถ„์˜ ๊ฒฝํ–ฅ์„ ํŒŒ์•…ํ•œ๋‹ค.

 

 ์˜ˆ๋ฅผ ๋“ค์–ด 500๊ฐœ์˜ ์ด๋ฉ”์ผ example์—์„œ 100๊ฐœ์˜ ์ด๋ฉ”์ผ์„ ์ž˜๋ชป ๋ถ„๋ฅ˜ํ–ˆ๋‹ค๊ณ  ํ•ด๋ณด์ž. ๊ทธ๋Ÿฌ๊ณ  ์ด ์ž˜๋ชป ๋ถ„๋ฅ˜๋œ 100๊ฐœ์˜ ์ด๋ฉ”์ผ์„ ๋ถ„์„ํ•ด์„œ ์–ด๋–ค ์œ ํ˜•์˜ ์ด๋ฉ”์ผ๋“ค์ธ์ง€ ๋ถ„๋ฅ˜ํ•ด๋ณด์ž. ์ƒˆ๋กœ์šด ์‹ ํ˜ธ์™€ feature๋ฅผ ์‚ฌ์šฉํ•ด์„œ ์ด 100๊ฐœ์˜ ์ด๋ฉ”์ผ์„ ์˜ฌ๋ฐ”๋ฅด๊ฒŒ ๋ถ„๋ฅ˜ํ•˜๋„๋ก ๋„์›€์„ ์ฃผ๋„๋ก ํ•œ๋‹ค. ๋”ฐ๋ผ์„œ ๋Œ€๋ถ€๋ถ„์˜ ์ž˜๋ชป ๋ถ„๋ฅ˜๋œ ์ด๋ฉ”์ผ์€ ๋น„๋ฐ€๋ฒˆํ˜ธ๋ฅผ ํ›”์น˜๋ ค ํ•œ๋‹ค. ๊ทธ๋Ÿฐ ๋‹ค์Œ ํ•ด๋‹น ์ด๋ฉ”์ผ์— ํŠน์ •ํ•œ ๋ช‡ ๊ฐ€์ง€ feature๋ฅผ ์ฐพ์•„ ๋ชจ๋ธ์— ์ถ”๊ฐ€ํ•  ์ˆ˜ ์žˆ๋‹ค. ๋˜ํ•œ ์–ด๊ทผ์— ๋”ฐ๋ผ ๊ฐ ๋‹จ์–ด๋ฅผ ๋ถ„๋ฅ˜ํ•˜๋ฉด ์˜ค๋ฅ˜์œจ์ด ์–ด๋–ป๊ฒŒ ๋ณ€ํ•˜๋Š”์ง€ ํ™•์ธํ•  ์ˆ˜ ์žˆ๋‹ค.

 

 

 ์˜ค๋ฅ˜์œจ์„ ํ•˜๋‚˜์˜ ์‹ค์ˆ˜ ๊ฐ’์œผ๋กœ ๊ฐ€์ง€๋Š” ๊ฒƒ์€ ๋งค์šฐ ์ค‘์š”ํ•˜๋‹ค. ๊ทธ ์™ธ์— ์•Œ๊ณ ๋ฆฌ์ฆ˜์˜ ์„ฑ๋Šฅ์„ ํ‰๊ฐ€ํ•˜๋Š” ๊ฒƒ์€ ํž˜๋“ค๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด ๋‹จ์–ด์— ๋Œ€ํ•ด stemming์„ ์‚ฌ์šฉํ•˜๋ฉด 5%์˜ ์˜ค๋ฅ˜์œจ ๋Œ€์‹ ์— 3%์˜ ์˜ค๋ฅ˜์œจ์„ ๊ฐ–๊ฒŒ ๋œ๋‹ค๊ณ  ํ•˜๋ฉด ๋ชจ๋ธ์— ์ด stemming์„ ์ถ”๊ฐ€ํ•ด์•ผ ํ•œ๋‹ค. ํ•˜์ง€๋งŒ ๋Œ€๋ฌธ์ž์™€ ์†Œ๋ฌธ์ž๋ฅผ ๊ตฌ๋ถ„ํ–ˆ์„ ๋•Œ 3%์˜ ์˜ค๋ฅ˜์œจ ๋Œ€์‹ ์— 3.2%์˜ ์˜ค๋ฅ˜์œจ์„ ๊ฐ–๊ฒŒ ๋œ๋‹ค๋ฉด ์ด feature์„ ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ์€ ํ”ผํ•ด์•ผ ํ•œ๋‹ค. ๋”ฐ๋ผ์„œ ์ƒˆ๋กœ์šด ๊ฒƒ์„ ์‹œ๋„ํ•˜๊ณ , ์˜ค๋ฅ˜์œจ์— ๋Œ€ํ•œ ์ˆ˜์น˜๋ฅผ ์–ป๊ณ , ๊ฒฐ๊ณผ์— ๋”ฐ๋ผ ์ƒˆ๋กœ์šด feature๋ฅผ ์œ ์ง€ํ• ์ง€ ์—ฌ๋ถ€๋ฅผ ๊ฒฐ์ •ํ•ด์•ผ ํ•œ๋‹ค.

'Lecture ๐Ÿง‘โ€๐Ÿซ > Coursera' ์นดํ…Œ๊ณ ๋ฆฌ์˜ ๋‹ค๋ฅธ ๊ธ€

[Machine Learning] Bias vs Variance  (0) 2023.03.27
[Machine Learning] Evaluating a Learning Algorithm  (0) 2023.03.27
[Machine Learning] Backpropagation in Practice  (0) 2023.03.27
[Machine Learning] Cost Function & Backpropagation  (0) 2023.03.26
[Machine Learning] Neural Networks  (0) 2023.03.20
'Lecture ๐Ÿง‘โ€๐Ÿซ/Coursera' ์นดํ…Œ๊ณ ๋ฆฌ์˜ ๋‹ค๋ฅธ ๊ธ€
  • [Machine Learning] Bias vs Variance
  • [Machine Learning] Evaluating a Learning Algorithm
  • [Machine Learning] Backpropagation in Practice
  • [Machine Learning] Cost Function & Backpropagation
Cartinoe
Cartinoe
Welcome! I'm a student studying about deep learning(NLP) ๐Ÿ˜‰ The goal of my study is to develop a competent LLM helping people!
  • faviconinstagram
  • faviconfacebook
  • favicongithub
  • faviconLinkedIn
Cartinoe's paper review
Cartinoe
Cartinoe
Cartinoe's paper review
Cartinoe
์ „์ฒด
์˜ค๋Š˜
์–ด์ œ
  • My Posting (141)
    • Paper Reading ๐Ÿ“œ (113)
      • Natural Language Processing (67)
      • Alignment Problem of LLM (11)
      • Computer Vision (4)
      • Deep Learning (6)
      • multimodal models (17)
      • Mathematics(์„ ํ˜•๋Œ€์ˆ˜, ํ™•๋ฅ ๊ณผ ํ†ต๊ณ„, ๋ฏธ.. (8)
    • Lecture ๐Ÿง‘โ€๐Ÿซ (16)
      • Hugging Face Course (1)
      • Coursera (15)
    • Insight ๐Ÿ˜Ž (10)
    • Research & Project ๐Ÿ”ฌ (2)

์ธ๊ธฐ ๊ธ€

์ตœ๊ทผ ๊ธ€

๊ณต์ง€์‚ฌํ•ญ

  • ๋ธ”๋กœ๊ทธ ๊ณต์ง€์‚ฌํ•ญ - ๋ชจ๋ฐ”์ผ ์ˆ˜์‹ ๊นจ์ง

ํƒœ๊ทธ

  • RLHF
  • Chinchilla
  • closed-source model
  • context length
  • LLAMA2
  • LLM
  • Evaluation Metric
  • Vicuna Evaluation
  • ChatGPT
  • Open-source
  • MT-Bench
  • LM
  • proprietary model
  • scaling law
  • GPT-4
  • open-source model
  • closed-source
  • transformer
  • Vicuna
  • context window
hELLO ยท Designed By ์ •์ƒ์šฐ.
Cartinoe
[Machine Learning] Machine Learning Algorithm Application
์ƒ๋‹จ์œผ๋กœ

ํ‹ฐ์Šคํ† ๋ฆฌํˆด๋ฐ”

๊ฐœ์ธ์ •๋ณด

  • ํ‹ฐ์Šคํ† ๋ฆฌ ํ™ˆ
  • ํฌ๋Ÿผ
  • ๋กœ๊ทธ์ธ

๋‹จ์ถ•ํ‚ค

๋‚ด ๋ธ”๋กœ๊ทธ

๋‚ด ๋ธ”๋กœ๊ทธ - ๊ด€๋ฆฌ์ž ํ™ˆ ์ „ํ™˜
Q
Q
์ƒˆ ๊ธ€ ์“ฐ๊ธฐ
W
W

๋ธ”๋กœ๊ทธ ๊ฒŒ์‹œ๊ธ€

๊ธ€ ์ˆ˜์ • (๊ถŒํ•œ ์žˆ๋Š” ๊ฒฝ์šฐ)
E
E
๋Œ“๊ธ€ ์˜์—ญ์œผ๋กœ ์ด๋™
C
C

๋ชจ๋“  ์˜์—ญ

์ด ํŽ˜์ด์ง€์˜ URL ๋ณต์‚ฌ
S
S
๋งจ ์œ„๋กœ ์ด๋™
T
T
ํ‹ฐ์Šคํ† ๋ฆฌ ํ™ˆ ์ด๋™
H
H
๋‹จ์ถ•ํ‚ค ์•ˆ๋‚ด
Shift + /
โ‡ง + /

* ๋‹จ์ถ•ํ‚ค๋Š” ํ•œ๊ธ€/์˜๋ฌธ ๋Œ€์†Œ๋ฌธ์ž๋กœ ์ด์šฉ ๊ฐ€๋Šฅํ•˜๋ฉฐ, ํ‹ฐ์Šคํ† ๋ฆฌ ๊ธฐ๋ณธ ๋„๋ฉ”์ธ์—์„œ๋งŒ ๋™์ž‘ํ•ฉ๋‹ˆ๋‹ค.