Women on Hugging Face

community

AI & ML interests

None defined yet.

Recent Activity

WomenonHuggingFace's activity

AdinaY 
posted an update 1 day ago
AdinaY 
posted an update 3 days ago
view post
Post
2429
🔥So many exciting releases coming from the Chinese community this month!
zh-ai-community/2025-january-6786b054f492fb223591269e

LLMs:
✨ Qwen2.5 -1M by Alibaba
Qwen/qwen25-1m-679325716327ec07860530ba
✨ InternLM3-8B-Instruct by Shanghai AI Lab
internlm/internlm3-8b-instruct
✨ MiniMax-Text-01 by MiniMax AI
MiniMaxAI/MiniMax-Text-01
✨ RWKV-7 by BlinkDL -- RNN + Transformer 👀
BlinkDL/rwkv-7-world
✨ DeepSeek-R1 by DeepSeek -- THE ONE 🙌
https://huggingface.co/deepseek-ai
✨ Baichuan-M1-14B by Baichuan - Medical 🩺
baichuan-inc/Baichuan-M1-14B-Base
✨ Qwen2.5-Math-PRM by Alibaba - Math 🔢
Qwen/Qwen2.5-Math-PRM-7B

Code:
✨ Tare by Bytedance
https://trae.ai

TTS:
✨ T2A-01-HD by MiniMax AI
https://hailuo.ai/audio
✨ LLaSA by HKUST Audio
HKUSTAudio/Llasa-3B

MLLM:
✨ Kimi k1.5 by Moonshot AI
https://kimi.ai
✨ MiniCPM-o-2_6 by OpenBMB
openbmb/MiniCPM-o-2_6
✨ Sa2VA-4B by ByteDance
ByteDance/Sa2VA-4B
✨ VideoLLaMA 3 by Alibaba DAMO
DAMO-NLP-SG/videollama3-678cdda9281a0e32fe79af15
✨ LLaVA-Mini by Chinese Academy of Sciences
ICTNLP/llava-mini-llama-3.1-8b
✨Hunyuan-7B by Tencent
tencent/Hunyuan-7B-Instruct
✨ Hunyuan 3D 2.0 by Tencent
tencent/Hunyuan3D-2
✨MiniMax-VL-01 by MiniMax AI - A non transformer based VLM 👀
MiniMaxAI/MiniMax-VL-01

Agent:
✨ UI-TARS by Bytedance
bytedance-research/UI-TARS-7B-SFT
✨ GLM-PC by Zhipu AI
https://cogagent.aminer.cn

Dataset:
✨ Fineweb-Edu-Chinese by Opencsg
opencsg/Fineweb-Edu-Chinese-V2.1
✨ Multimodal_textbook by Alibaba
DAMO-NLP-SG/multimodal_textbook
✨ MME-Finance by Hithink AI
·
AdinaY 
posted an update 5 days ago
AdinaY 
posted an update 6 days ago
AdinaY 
posted an update 7 days ago
AdinaY 
posted an update 8 days ago
view post
Post
2803
What happened yesterday in the Chinese AI community? 🚀

T2A-01-HD 👉 https://hailuo.ai/audio
MiniMax's Text-to-Audio model, now in Hailuo AI, offers 300+ voices in 17+ languages and instant emotional voice cloning.

Tare 👉 https://www.trae.ai/
A new coding tool by Bytedance for professional developers, supporting English & Chinese with free access to Claude 3.5 and GPT-4 for a limited time.

DeepSeek-R1 Series 👉 deepseek-ai/deepseek-r1-678e1e131c0169c0bc89728d
Open-source reasoning models with MIT license by DeepSeek.

Kimi K 1.5 👉 https://github.com/MoonshotAI/Kimi-k1.5 | https://kimi.ai/
An O1-level multi-modal model by MoonShot AI, utilizing reinforcement learning with long and short-chain-of-thought and supporting up to 128k tokens.

And today…

Hunyuan 3D-2.0 👉 tencent/Hunyuan3D-2
A SoTA 3D synthesis system for high-res textured assets by Tencent Hunyuan , with open weights and code!

Stay tuned for more updates 👉 https://huggingface.co/zh-ai-community
AdinaY 
posted an update 8 days ago
view post
Post
856
Hunyuan 3D 2.0🔥 a synthesis system for high-res textured 3D assets released by Tencent Hunyuan

2 key components: Hunyuan3D-DiT (geometry) and Hunyuan3D-Paint (textures) work together, achieving highly realistic 3D results.

Model: tencent/Hunyuan3D-2
Demo coming soon!
AdinaY 
posted an update 9 days ago
view post
Post
2782
BIG release by DeepSeek AI🔥🔥🔥

DeepSeek-R1 & DeepSeek-R1-Zero: two 660B reasoning models are here, alongside 6 distilled dense models (based on Llama & Qwen) for the community!
https://huggingface.co/deepseek-ai
deepseek-ai/DeepSeek-R1

✨ MIT License : enabling distillation for custom models
✨ 32B & 70B models match OpenAI o1-mini in multiple capabilities
✨ API live now! Access Chain of Thought reasoning with model='deepseek-reasoner'
AdinaY 
posted an update 13 days ago
AdinaY 
posted an update 14 days ago
AdinaY 
posted an update 15 days ago
view post
Post
3090
MiniMax, the company behind Hailuo_AI, has joined the open source community by releasing both models and demos of MiniMax-Text-01 & MiniMax-VL-01🔥
- Model
MiniMaxAI/MiniMax-VL-01
MiniMaxAI/MiniMax-Text-01
- Demo
MiniMaxAI/MiniMax-VL-01
MiniMaxAI/MiniMax-Text-01

✨ MiniMax-text-01:
- 456B with 45.9B activated per token
- Combines Lightning Attention, Softmax Attention, and MoE for optimal performance
- Training context up to 1M tokens, inference handles 4M tokens

✨ MiniMax-VL-01:
- ViT-MLP-LLM framework ( non-transformer👀)
- Handles image inputs from 336×336 to 2016×2016
- 694M image-caption pairs + 512B tokens processed across 4 stages
  • 1 reply
·
AdinaY 
posted an update 15 days ago
view post
Post
3175
MiniCPM-o2.6 🔥 an end-side multimodal LLMs released by OpenBMB from the Chinese community
Model: openbmb/MiniCPM-o-2_6
✨ Real-time English/Chinese conversation, emotion control and ASR/STT
✨ Real-time video/audio understanding
✨ Processes up to 1.8M pixels, leads OCRBench & supports 30+ languages
meg 
posted an update 16 days ago
view post
Post
2956
💫...And we're live!💫 Seasonal newsletter from ethicsy folks at Hugging Face, exploring the ethics of "AI Agents"
https://huggingface.co/blog/ethics-soc-7
Our analyses found:
- There's a spectrum of "agent"-ness
- *Safety* is a key issue, leading to many other value-based concerns
Read for details & what to do next!
With @evijit , @giadap , and @sasha
AdinaY 
posted an update 19 days ago
BrigitteTousi 
posted an update 20 days ago
view post
Post
1059
Community fine-tuned models are more carbon efficient than the models they are derived from! 🥳🌿

@alozowski @clefourrier @SaylorTwift @albertvillanova evaluated CO₂ emissions associated with model inference for over 3000 models on the Open LLM Leaderboard. Interesting trends and new insights emerged...👀

Blog Post: https://huggingface.co/blog/leaderboard-emissions-analysis

Leaderboard: open-llm-leaderboard/open_llm_leaderboard
AdinaY 
posted an update 23 days ago
AdinaY 
posted an update about 1 month ago
nityan 
posted an update about 1 month ago
view post
Post
1632
#001 | A journey into open-source Hugging Face Models on Azure AI

December is the month for New Year resolutions - and this year I am determined to write more on Hugging Face. I kept putting this off thinking I wanted to have time to craft perfect long-form articles, but then I discovered we can do quick posts. So why wait till January?

I am a PhD, a Polyglot, a Parent, a Visual Storyteller, a Community Builder - and an AI Advocate at Microsoft. However, if I look back on my 25+ years in tech, what I love most is to help people learn by making complex concepts feel more accessible and actionable regardless of your background or expertise. And in 2025, I want to use a #NityaLearnsAI tagline as a way to share my learning journey, explore the vast space of AI tools and technologies, amplify our open-source community and put the fun back in fundamentals. I hope you find it useful and will join me!

My first post is on this Microsoft Ignite theater session delivered in Nov:
https://ignite.microsoft.com/en-US/sessions/THR502?source=sessions It was not recorded but can find the slides here: https://speakerdeck.com/nitya/thr502-journey-into-open-source-hugging-face-models-on-azure-ai - and the illustrated guide attached below summarizes the talk in one big picture.

At the core, this is about my growing interest in **Model Choice** and learning more about not just frontier models but the much larger ecosystem of open-source variants and the community creators who build them. See:

1. Oct / The Future of AI is model choice / https://techcommunity.microsoft.com/blog/aiplatformblog/the-future-of-ai-is-model-choice---from-structured-process-to-seamless-platform/4284091
2. Sep / HF Models Recap / https://techcommunity.microsoft.com/blog/aiplatformblog/new-hugging-face-models-on-azure-ai-phi-3-variants-from
3. Aug / HF Models Recap / https://techcommunity.microsoft.com/blog/aiplatformblog/new-hugging-face-models-on-azure-ai-multilingual-slm-and-biomed--july-2024-updat/4211881
  • 1 reply
·
AdinaY 
posted an update about 1 month ago
view post
Post
3028
QvQ-72B-Preview🎄 an open weight model for visual reasoning just released by Alibaba_Qwen team
Qwen/qvq-676448c820912236342b9888
✨ Combines visual understanding & language reasoning.
✨ Scores 70.3 on MMMU
✨ Outperforms Qwen2-VL-72B-Instruct in complex problem-solving
AdinaY 
posted an update about 1 month ago
view post
Post
553
Megrez-3B-Omni 🔥 an on-device multimodal LLM by Infinigence AI, another startup emerging from the Tsinghua University ecosystem.
Model: Infinigence/Megrez-3B-Omni
Demo: Infinigence/Megrez-3B-Omni
✨Supports analysis of image, text, and audio modalities
✨Leads in bilingual speech ( English & Chinese ) input, multi-turn conversations, and voice-based queries
✨Outperforms in scene understanding and OCR across major benchmarks