Daily-Trend-Review

2023/03/07: Specialized LLMs, Prompt Engineering 등

hellcat 2023. 3. 7. 09:05

1. Specialized LLMs: ChatGPT, LaMDA, Galactica, Codex, Sparrow, and More

source: https://medium.com/towards-data-science/specialized-llms-chatgpt-lamda-galactica-codex-sparrow-and-more-ccccdd9f666f

LLM 동작 개선

prompt pretraining

Generic LM이 부족한 점

Alignment
Domain Specialization

LaMDA

모델을 확장하면 대화 품질이 어느 정도 향상되지만 근거나 안전성과 같은 지표는 개선할 수 없음
3가지 원칙
- Quality (sensibleness, specificity, interestingness의 평균)
  - sensibleness (does the model make sense and not contradict earlier dialog?),
  - specificity (is the model’s response specific to the given context?)
  - interestingness (does the model’s response capture the reader’s attention or arouse curiosity?).
- Safety
  - ability to avoid unintended or harmful results that contradict objectives derived from the Google AI Principles.
- Groundedness
  - producing responses that are factually correct and can be associated with authoritative, external sources.

Sparrow

Sparrow는 700억 개의 매개변수인 Chinchilla 모델(DPC라고 하는 대화 프롬프트 Chinchilla 또는 DPC라고 함)을 사용하여 초기화됨
온라인에서 증거를 찾기 위해 Sparrow는 "Search Query:" 문자열 다음에 텍스트 검색 쿼리를 출력하는 방법을 배움
그런 다음 Google에서 이 쿼리에 대한 응답을 검색하고 필터링하여 검색 결과를 얻습니다.

2. Prompt Engineering Guide

source: https://github.com/dair-ai/Prompt-Engineering-Guide?fbclid=IwAR3zJg1sywwkMGpXwp-JauFnP3-Av02uXE_wC81NiRUveBi5dXSsy-xrguU

프롬프트만 잘 작성하면 원하는 결과물을 얻을 수 있음

3. 프롬프트 엔지니어링과 인젝션: 새로운 IP 훔치기

source: https://medium.com/coxwave%EB%B8%94%EB%A1%9C%EA%B7%B8/%ED%94%84%EB%A1%AC%ED%94%84%ED%8A%B8-%EC%97%94%EC%A7%80%EB%8B%88%EC%96%B4%EB%A7%81%EA%B3%BC-%EC%9D%B8%EC%A0%9D%EC%85%98-%EC%83%88%EB%A1%9C%EC%9A%B4-ip-%ED%9B%94%EC%B9%98%EA%B8%B0-43588bde9c92

프롬프트를 얼마나 잘 썼느냐에 따라 생성된 출력 결과물의 품질이 결정됨
프롬프트 Injection은 해결해야 할 숙제임

4. ChatGPT 업무효율화 가이드

source: https://jehyunlee.github.io/2023/02/15/Python-General-16-chatgpt/

보고서 작성 OK
새로운 아이디어 탐색 OK
같은 내용을 다른 어조로 변화 OK
단락을 요약하거나 번역하기 OK
표 해석하기 OK
나머지 NOT OK (검증 필수)

5. Language Models: GPT and GPT-2

source: https://towardsdatascience.com/language-models-gpt-and-gpt-2-8bdb9867c50a?source=user_profile---------6----------------------------

모든 app을 위해 새로운 모델을 학습하는 것보다 Single LM을 학습하여 다양한 문제를 풀도록 하는 것이 더 유용함

6. The 2023 edition of the Machine Learning, AI and Data Landscape — a quick analysis

https://towardsdatascience.com/specialized-llms-chatgpt-lamda-galactica-codex-sparrow-and-more-ccccdd9f666f

'Daily-Trend-Review' 카테고리의 다른 글

2023/03/09: AI/ML landscape (0)	2023.03.08
2023/03/08: Disffusion Models Course, LM vs Thinking 등 (0)	2023.03.08
2023/03/06: LLaMA, OpenAI ChatGPT&Whisper APIs 등 (0)	2023.03.06
2023/03/05: Generative AI landscape (0)	2023.03.05
2023/03/03: OpenAI Triton & Pytorch 2.0 (0)	2023.03.03

현재글2023/03/07: Specialized LLMs, Prompt Engineering 등

AI, Quant 투자 공부

글쓰기 좋아하는 AI 엔지니어의 AI와 Quant 투자 스터디를 위한 공간

State of GPT, Generative-AI, mdd, ChatGPT, LLaMA-Adapter, 강환국, transformer, 삼프로tv, 거인의포트폴리오, 정채진프로, QLORA, vscode, llma, jupyter notebook, training, GPT, 퀀트투자, llm, etf, gpt-4,

Today :
Yesterday :

일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

AI, Quant 투자 공부

2023/03/07: Specialized LLMs, Prompt Engineering 등

1. Specialized LLMs: ChatGPT, LaMDA, Galactica, Codex, Sparrow, and More

LLM 동작 개선

Generic LM이 부족한 점

LaMDA

Sparrow

2. Prompt Engineering Guide

3. 프롬프트 엔지니어링과 인젝션: 새로운 IP 훔치기

4. ChatGPT 업무효율화 가이드

5. Language Models: GPT and GPT-2

6. The 2023 edition of the Machine Learning, AI and Data Landscape — a quick analysis

'Daily-Trend-Review' 카테고리의 다른 글

'Daily-Trend-Review'의 다른글

티스토리툴바

단축키

내 블로그

블로그 게시글

모든 영역

2023/03/07: Specialized LLMs, Prompt Engineering 등

1. Specialized LLMs: ChatGPT, LaMDA, Galactica, Codex, Sparrow, and More

LLM 동작 개선

Generic LM이 부족한 점

LaMDA

Sparrow

2. Prompt Engineering Guide

3. 프롬프트 엔지니어링과 인젝션: 새로운 IP 훔치기

4. ChatGPT 업무효율화 가이드

5. Language Models: GPT and GPT-2

6. The 2023 edition of the Machine Learning, AI and Data Landscape — a quick analysis

'Daily-Trend-Review' 카테고리의 다른 글

'Daily-Trend-Review'의 다른글

관련글

티스토리툴바

단축키

내 블로그

블로그 게시글

모든 영역