metadata
license: cc-by-nc-4.0
language:
- ko
- en
tags:
- 42dot
- PLM
42dot_LLM-PLM-1.3B_GGUF
- ๋ชจ๋ธ ๋ง๋ ์ฌ๋: 42dot
- ์๋ณธ ๋ชจ๋ธ: 42dot_LLM-PLM-1.3B
์ค๋ช
42dot ๋ชจ๋ธ์ GGUF ๊ฒฝ๋ํ ๋ชจ๋ธ์ ๋ง๋ค์ด ๋์ต๋๋ค.
ํ์ผ
๋งํฌ์ ์ฐ๊ฒฐ ํด๋์์ผ๋ ํ์ํ์ ๋ถ์ ํํธ ์ฃผ๊ณ ์ฑ๊ฒจ ๊ฐ์ธ์. gguf ์๋ณธ ํ์ผ
Q4, Q8 ๊ฒฝ๋ํ ํ์ผ
์ด์ธ ๋ชจ๋ธ์ ๊ทผ๋ณธ ์์ด์ ์ฌ๋ฆด๊น ํ๋ค๊ฐ ์ ์ฌ๋ฆฌ๋ ค๊ณ ํฉ๋๋ค.
์ฌ์ฉ๋ฒ
์๋ณธ ๋งํฌ์์ ์ฌ์ฉ ๋ฒ์ ํ์ธํ์ธ์.
Llama.cpp๋ก ์ฌ์ฉ๋ฒ ์ํ
For simple inferencing, use a command similar to
./main -m gguf-q4_k_m.gguf --temp 0 --top-k 4 --prompt "who was Joseph Weizenbaum?"
Llama.cpp๋ก ํ ํฌ๋์ด์ง ์ํ
To get a list of tokens, use a command similar to
./tokenization -m gguf-q4_k_m.gguf --prompt "who was Joseph Weizenbaum?"
Llama.cpp๋ก ์๋ฒ ๋ฉ ์ํ
Text embeddings are calculated with a command similar to
./embedding -m gguf-q4_k_m.gguf --prompt "who was Joseph Weizenbaum?"
License
์๋ณธ ๋ชจ๋ธ ๋ผ์ด์ผ์ค๋ Creative Commons Attribution-NonCommercial 4.0 (CC BY-NC 4.0) ์ฐธ๊ณ ํ์ธ์.