Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
tsinghua-ee
/
SALMONN
like
41
Follow
Electronic Engineering @Tsinghua University
9
Automatic Speech Recognition
PyTorch
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
arxiv:
2310.13289
arxiv:
2406.15704
License:
apache-2.0
Model card
Files
Files and versions
Community
2
main
SALMONN
5 contributors
History:
32 commits
cz277
AdinaY
HF staff
Add paper link (
#2
)
5b693e4
verified
5 months ago
beats
chore: release v1
about 1 year ago
other_third-party_licenses
chore: release v1
about 1 year ago
qformer
chore: release v1
about 1 year ago
resource
chore: release v1
about 1 year ago
.gitattributes
56 Bytes
chore: release v1
about 1 year ago
.gitignore
3.1 kB
chore: release v1
about 1 year ago
LICENSE
11.3 kB
chore: release v1
about 1 year ago
README.md
6.08 kB
Add paper link (#2)
5 months ago
cli_inference.py
1.98 kB
chore: add lora alpha
about 1 year ago
model.py
9.79 kB
chore: release v1
about 1 year ago
requirements.txt
160 Bytes
Create requirements.txt
about 1 year ago
salmonn_v1.pth
pickle
Detected Pickle imports (4)
"collections.OrderedDict"
,
"torch.LongStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
What is a pickle import?
400 MB
LFS
Upload salmonn_v1.pth
about 1 year ago
web_demo.py
7.32 kB
chore: change sac prompot
about 1 year ago