6 6 24

Steven Zheng

Steveeeeeeen

AI & ML interests

None yet

Recent Activity

New activity 1 day ago

hf-audio/open_asr_leaderboard:Disable SSR mode

updated a Space 1 day ago

hf-audio/open_asr_leaderboard

New activity 1 day ago

hf-audio/open_asr_leaderboard:Update LAST_UPDATED date to 'Nov 22nd 2024'

View all activity

Organizations

Steveeeeeeen's activity

New activity in hf-audio/open_asr_leaderboard 1 day ago

Disable SSR mode

#27 opened 1 day ago by

Steveeeeeeen

updated a Space 1 day ago

Running on CPU Upgrade

540

🏆

Open ASR Leaderboard

New activity in hf-audio/open_asr_leaderboard 1 day ago

Update LAST_UPDATED date to 'Nov 22nd 2024'

#26 opened 1 day ago by

Steveeeeeeen

Rename 'Avg. RTFx' to 'RTFx' to match CSV column headers

#25 opened 1 day ago by

Steveeeeeeen

Fix Gradio Dataframe headers

#24 opened 1 day ago by

Steveeeeeeen

Updating Gradio version

#22 opened 1 day ago by

Steveeeeeeen

liked a model 1 day ago

aiola/whisper-ner-v1

Automatic Speech Recognition • Updated 2 days ago • 131 • 16

Reacted to merve's post with 🔥 1 day ago

Post

1473

What a week! A recap for everything you missed ❄️
merve/nov-22-releases-673fbbcfc1c97c4f411def07
Multimodal ✨
> Mistral AI
released Pixtral 124B, a gigantic open vision language model
> Llava-CoT (formerly known as Llava-o1) was released, a multimodal reproduction of o1 model by PKU
> OpenGVLab released MMPR: a new multimodal reasoning dataset
> Jina has released Jina-CLIP-v2 0.98B multilingual multimodal embeddings
> Apple released new SotA vision encoders AIMv2

LLMs 🦙
> AllenAI dropped a huge release of models, datasets and scripts for Tülu, a family of models based on Llama 3.1 aligned with SFT, DPO and a new technique they have developed called RLVR
> Jina has released embeddings-v3: new multilingual embeddings with longer context
> Hugging Face released SmolTalk: synthetic dataset used to align SmolLM2 using supervised fine-tuning
> Microsoft released orca-agentinstruct-1M-v1: a gigantic instruction dataset of 1M synthetic instruction pairs

Image Generation 🖼️
> Black Forest Labs released Flux 1. tools: four new models for different image modifications and two LoRAs to do image conditioning and better steer generations

Lastly Hugging Face released a new library Observers: a lightweight SDK for monitoring interactions with AI APIs and easily store and browse them 📚
$ pip install observers