Custom 4-bit Finetuning 5-7 times faster inference than QLora
pinned#13 opened over 1 year ago
by
rmihaylov
Example Use
pinned
30
#1 opened over 1 year ago
by
Supreeth
429 Client Error: Too Many Requests for url
1
#123 opened 2 months ago
by
prasannaJK8
Interview request: genAI evaluation & documentation
#122 opened 3 months ago
by
evatang
Finetuning with Bengali Dataset Fails : Need Urgent Help ?
#121 opened 6 months ago
by
uknowWho42
Vision to text capabilities
#120 opened 7 months ago
by
kmewhort
ValueError: Error raised by inference API: Cannot override task for LLM models
1
#119 opened 7 months ago
by
joangonzaleezzz
[AUTOMATED] Model Memory Requirements
#118 opened 8 months ago
by
model-sizer-bot
[AUTOMATED] Model Memory Requirements
#117 opened 8 months ago
by
model-sizer-bot
422 Unprocessable Entity when trying to use conversational API
#116 opened 8 months ago
by
RandomDebugGuy
torch.cuda.OutOfMemoryError: CUDA out of memory.
1
#115 opened 10 months ago
by
SlyGoblin
Update for langchain
#113 opened 10 months ago
by
mbinkamran
Update README.md
#112 opened 11 months ago
by
LauraSophie
Can Falcon have supported UTF-8?
#110 opened about 1 year ago
by
vutruc
No package metadata was found for bitsandbytes
10
#109 opened about 1 year ago
by
Fyutong
Adding Evaluation Results
#108 opened about 1 year ago
by
leaderboard-pr-bot
Compare Multiple Document using this model.
#104 opened about 1 year ago
by
Pitambarmuduli
Upload model
#103 opened about 1 year ago
by
dookplaza
Reasearch team in Nile university.
#101 opened about 1 year ago
by
kirollossaleh
This is an instruct model, which may not be ideal for further finetuning.
1
#100 opened about 1 year ago
by
rakeshda
tiiuae/falcon-7b-instruct does not appear to have a file named config.json
#99 opened about 1 year ago
by
robotrage
falcon-7b-instruct responds with weird and short answers ?
#98 opened about 1 year ago
by
olsi8
model not generating text
#97 opened about 1 year ago
by
airedwin
Trying to deploy this using Inference Endpoints
1
#96 opened about 1 year ago
by
Kolibri753
Code for using inference api
#94 opened about 1 year ago
by
Karthik2003
Inference Api
#93 opened about 1 year ago
by
Karthik2003
The correctness of the result using transformers apply_chat_template
1
#92 opened about 1 year ago
by
Annorita
Request: DOI
#91 opened about 1 year ago
by
binbin888
How is falcon able to generate indic token , when it is not trained on Indic languages. even tokenizer.json doest have any Indic tokens.
#90 opened about 1 year ago
by
Sibadatta
How to solve this issue?
#87 opened about 1 year ago
by
Xiaogeng-SheltonLiu
Update generation_config.json
1
#86 opened about 1 year ago
by
nkasmanoff
Adding `safetensors` variant of this model
#84 opened about 1 year ago
by
bharathrajcl
Hardware requirements on falcon models: 7B, 40B, 180B
2
#83 opened about 1 year ago
by
its-eric-liu
Switching Falcon to other language
#82 opened about 1 year ago
by
wokalove
Update config.json
#81 opened about 1 year ago
by
Daniil-plotnikov
'num_return_sequences' & 'num_beams' can't be changed in inference API calls
#80 opened about 1 year ago
by
mrscoopers
Adding `safetensors` variant of this model
#79 opened about 1 year ago
by
bikalnetomi
Facing Issues with Model Output and Inference Times
1
#78 opened about 1 year ago
by
ankity09
License file
#77 opened about 1 year ago
by
jsaurabh
Share your recommended configurations for speed.
#76 opened over 1 year ago
by
archonlith
Adding `safetensors` variant of this model
#75 opened over 1 year ago
by
Shridharalve
Use input attention mask instead of casual mask in attention
#74 opened over 1 year ago
by
CyberZHG
Merge SentenceTransformer with Falcon-7b-instruct
#73 opened over 1 year ago
by
Alkahwaji
integration issue with Langchain csv agent
3
#71 opened over 1 year ago
by
DanCher
Issue with Falcon LLM while trying to use it on AWS EC2 Inferentia 2.8xlarge Instance
#70 opened over 1 year ago
by
AmlanSamanta
while giving a input but getting the wrong output for the particular input
#69 opened over 1 year ago
by
saidheer
Getting message Killed when loading on multi-gpu
2
#68 opened over 1 year ago
by
jurecucek
falcon in node.js
#67 opened over 1 year ago
by
othman95
falcon-7b-instruct is answering out of context
#66 opened over 1 year ago
by
kvmukilan
How to use the CoreML model?
5
#65 opened over 1 year ago
by
yyjhao