UsefulSensors
/

moonshine

Automatic Speech Recognition

Keras

ONNX

English

Model card Files Files and versions Community

keveman commited on Oct 20, 2024

Commit

b78ccbb

verified ·

1 Parent(s): 556fbd4

Update README.md

Browse files

Files changed (1) hide show

README.md +55 -0

README.md CHANGED Viewed

@@ -62,3 +62,58 @@ In addition, the sequence-to-sequence architecture of the model makes it prone t
 We anticipate that Moonshine models’ transcription capabilities may be used for improving accessibility tools, especially for real-time transcription. The real value of beneficial applications built on top of Moonshine models suggests that the disparate performance of these models may have real economic implications.
 There are also potential dual-use concerns that come with releasing Moonshine. While we hope the technology will be used primarily for beneficial purposes, making ASR technology more accessible could enable more actors to build capable surveillance technologies or scale up existing surveillance efforts, as the speed and accuracy allow for affordable automatic transcription and translation of large volumes of audio communication. Moreover, these models may have some capabilities to recognize specific individuals out of the box, which in turn presents safety concerns related both to dual use and disparate performance. In practice, we expect that the cost of transcription is not the limiting factor of scaling up surveillance projects.

 We anticipate that Moonshine models’ transcription capabilities may be used for improving accessibility tools, especially for real-time transcription. The real value of beneficial applications built on top of Moonshine models suggests that the disparate performance of these models may have real economic implications.
 There are also potential dual-use concerns that come with releasing Moonshine. While we hope the technology will be used primarily for beneficial purposes, making ASR technology more accessible could enable more actors to build capable surveillance technologies or scale up existing surveillance efforts, as the speed and accuracy allow for affordable automatic transcription and translation of large volumes of audio communication. Moreover, these models may have some capabilities to recognize specific individuals out of the box, which in turn presents safety concerns related both to dual use and disparate performance. In practice, we expect that the cost of transcription is not the limiting factor of scaling up surveillance projects.
+## Setup
+* Install `uv` for Python environment management
+  - Follow instructions [here](https://github.com/astral-sh/uv)
+* Create and activate virtual environment
+  ```shell
+    uv venv env_moonshine
+    source env_moonshine/bin/activate
+  ```
+* Install the `useful-moonshine` package from this github repo
+  ```shell
+  uv pip install useful-moonshine@git+https://github.com/usefulsensors/moonshine.git
+  ```
+  `moonshine` inference code is written in Keras and can run with the backends
+  that Keras supports. The above command will install with the PyTorch
+  backend. To run the provided inference code, you have to instruct Keras to use
+  the PyTorch backend by setting and environment variable .
+  ```shell
+  export KERAS_BACKEND=torch
+  ```
+  To run with TensorFlow backend, run the following to install Moonshine.
+  ```shell
+  uv pip install useful-moonshine[tensorflow]@git+https://github.com/usefulsensors/moonshine.git
+  export KERAS_BACKEND=tensorflow
+  ```
+  To run with jax backend, run the following:
+  ```shell
+  uv pip install useful-moonshine[jax]@git+https://github.com/usefulsensors/moonshine.git
+  export KERAS_BACKEND=jax
+  # Use useful-moonshine[jax-cuda] for jax on GPU
+  ```
+* Test transcribing an audio file
+  ```shell
+  python
+  >>> import moonshine
+  >>> moonshine.transcribe(moonshine.ASSETS_DIR / 'beckett.wav', 'moonshine/tiny')
+  ['Ever tried ever failed, no matter try again, fail again, fail better.']
+  ```
+  * The first argument is the filename for an audio file, the second is the name of a moonshine model. `moonshine/tiny` and `moonshine/base` are the currently available models.