Edit model card

Model Details

Model Developers Seungyoo Lee (DopeorNope)

์ด ๋ชจ๋ธ์€ Mistral Base์˜ ์ƒˆ๋กœ์šด ์•„ํ‚คํ…์ณ์ด๋ฉฐ, 10.7B์˜ ํŒŒ๋ผ๋ฏธํ„ฐ๋กœ ๊ตฌ์„ฑ๋˜์—ˆ์Šต๋‹ˆ๋‹ค. (Solar๋‚˜, ์‹œ๋‚˜ํŠธ๋ผ ๋ฒ ์ด์Šค ๋ชจ๋ธ์ด ์•„๋‹™๋‹ˆ๋‹ค.)

์•ฝ 1.5B์˜ ํ† ํฐ์œผ๋กœ pretrain ๋˜์—ˆ์œผ๋‚˜, ์‹คํ—˜๋‹จ๊ณ„๋กœ ํ–ฅํ›„ ๋‹ค์‹œ ํ›ˆ๋ จ๋˜์–ด ์ƒˆ๋กญ๊ฒŒ ๋‚˜์˜ฌ ์˜ˆ์ •์ž…๋‹ˆ๋‹ค.

ํ…Œ์ŠคํŠธ์šฉ์œผ๋กœ ์˜ฌ๋ ค๋ด…๋‹ˆ๋‹ค.

Context length๊ฐ€ 32k ๊นŒ์ง€์ง€์› ๊ฐ€๋Šฅํ•œ ๋ชจ๋ธ์ด๋ฉฐ, ํ–ฅํ›„ ๋” ์™„๋ฒฝํ•˜๊ฒŒ ์„ค๊ณ„ํ•˜์—ฌ ์˜ฌ๋ฆฌ๋„๋ก ํ•˜๊ฒ ์Šต๋‹ˆ๋‹ค.

Downloads last month
1,144
Safetensors
Model size
10.8B params
Tensor type
F32
ยท
Inference API
Model is too large to load in Inference API (serverless). To try the model, launch it on Inference Endpoints (dedicated) instead.

Collection including DopeorNope/Mistralopithecus-v0.1-10.8B