This is a d-Matrix functional reference of the EleutherAI/gpt-j-6b model.
The reference provides the following functional configurations:
Configuration | Explanation |
---|---|
BASELINE |
a reference functionally equivalent to the original model |
BASIC |
all linear algebraic operands quantized to BFP16-64 , and all other operations transformed to approximated kernel simulations |
Usage
Install d-Matrix ML Tools first.
pip install dmx-mltools
The following is an example model and its evaluation.
from mltools.dmx import pipeline
pipe = pipeline(
task="text-generation",
model="d-matrix/gpt-j-6b",
dmx_config="BASELINE", # see above for other variants
)
results = pipe.evaluate(
metric="d-matrix/dmx_perplexity",
dataset="wikitext",
dataset_version="wikitext-2-raw-v1",
)
Evaluation results
- Downloads last month
- 502
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.