metadata
license: cc-by-nc-4.0
42dot_LLM-PLM-1.3B_GGUF
- ๋ชจ๋ธ ๋ง๋ ์ฌ๋: 42dot
- ์๋ณธ ๋ชจ๋ธ: 42dot_LLM-PLM-1.3B
์ค๋ช
42dot ๋ชจ๋ธ์ GGUF ๊ฒฝ๋ํ ๋ชจ๋ธ์ ๋ง๋ค์ด ๋์ต๋๋ค.
ํ์ผ
๋งํฌ์ ์ฐ๊ฒฐ ํด๋์์ผ๋ ํ์ํ์ ๋ถ์ ํํธ ์ฃผ๊ณ ์ฑ๊ฒจ ๊ฐ์ธ์. gguf ์๋ณธ ํ์ผ
Q4, Q8 ๊ฒฝ๋ํ ํ์ผ
์ด์ธ ๋ชจ๋ธ์ ๊ทผ๋ณธ ์์ด์ ์ฌ๋ฆด๊น ํ๋ค๊ฐ ์ ์ฌ๋ฆฌ๋ ค๊ณ ํฉ๋๋ค.
์ฌ์ฉ๋ฒ
์๋ณธ ๋งํฌ์์ ์ฌ์ฉ ๋ฒ์ ํ์ธํ์ธ์.
Llama.cpp๋ก ์ฌ์ฉ๋ฒ ์ํ
For simple inferencing, use a command similar to
./main -m gguf-q4_k_m.gguf --temp 0 --top-k 4 --prompt "who was Joseph Weizenbaum?"
Llama.cpp๋ก ํ ํฌ๋์ด์ง ์ํ
To get a list of tokens, use a command similar to
./tokenization -m gguf-q4_k_m.gguf --prompt "who was Joseph Weizenbaum?"
Llama.cpp๋ก ์๋ฒ ๋ฉ ์ํ
Text embeddings are calculated with a command similar to
./embedding -m gguf-q4_k_m.gguf --prompt "who was Joseph Weizenbaum?"
License
์๋ณธ ๋ชจ๋ธ ๋ผ์ด์ผ์ค๋ Creative Commons Attribution-NonCommercial 4.0 (CC BY-NC 4.0) ์ฐธ๊ณ ํ์ธ์.