Go to homeJozu logo
BACK

ast-finetuned-audioset-10-10-0.4593

:

Use the Pull Tag button to download this ModelKit.

Or, read our KitOps documentation to learn how to use kit unpack --filter to download only the components you need.

Package
Name
ast-finetuned-audioset-10-10-0.4593
Version
1.0.0
Authors
MIT
Description
Audio Spectrogram Transformer (AST) model fine-tuned on AudioSet. The Audio Spectrogram Transformer is equivalent to ViT, but applied on audio. Audio is first turned into an image (as a spectrogram), after which a Vision Transformer is applied. The model gets state-of-the-art results on several audio classification benchmarks.
Model
Name
model.safetensors
Path
model.safetensors
License
BSD-3 clause
Parts
Path: config.json
Type:
Path: preprocessor_config.json
Type:
Path: pytorch_model.bin
Type:
Docs
README.md
Readme file.