YAML Metadata Warning: The pipeline tag "conversational" is not in the official list: text-classification, token-classification, table-question-answering, question-answering, zero-shot-classification, translation, summarization, feature-extraction, text-generation, text2text-generation, fill-mask, sentence-similarity, text-to-speech, text-to-audio, automatic-speech-recognition, audio-to-audio, audio-classification, audio-text-to-text, voice-activity-detection, depth-estimation, image-classification, object-detection, image-segmentation, text-to-image, image-to-text, image-to-image, image-to-video, unconditional-image-generation, video-classification, reinforcement-learning, robotics, tabular-classification, tabular-regression, tabular-to-text, table-to-text, multiple-choice, text-retrieval, time-series-forecasting, text-to-video, image-text-to-text, visual-question-answering, document-question-answering, zero-shot-image-classification, graph-ml, mask-generation, zero-shot-object-detection, text-to-3d, image-to-3d, image-feature-extraction, video-text-to-text, keypoint-detection, any-to-any, other

BigCodeLLama LFG πŸš€

Experimental CodeLlaMA frankenstein to see how it benchmarks

Models Merged with base codellama/CodeLlama-70b-hf

The following models were included in the merge:

  • ../CodeLlama-70b-hf
  • ../CodeLlama-70b-Instruct-hf
  • ../CodeLlama-70b-Python-hf

Configuration

The following YAML configuration was used to produce this model:

dtype: bfloat16
merge_method: passthrough
slices:
- sources:
  - layer_range: [0, 69]
    model:
      model:
        path: ../CodeLlama-70b-hf
- sources:
  - layer_range: [66, 76]
    model:
      model:
        path: ../CodeLlama-70b-Instruct-hf
- sources:
  - layer_range: [42, 66]
    model:
      model:
        path: ../CodeLlama-70b-hf
- sources:
  - layer_range: [13, 37]
    model:
      model:
        path: ../CodeLlama-70b-Python-hf
- sources:
  - layer_range: [10, 80]
    model:
      model:
        path: ../CodeLlama-70b-Instruct-hf

Stay tuned for GGUFs quants

Downloads last month
55
Safetensors
Model size
169B params
Tensor type
BF16
Β·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for nisten/BigCodeLlama-169b

Finetuned
(1)
this model

Space using nisten/BigCodeLlama-169b 1