Aurรฉlien-Morgan CLAUDON

Aurelien-Morgan

AI & ML interests

None yet

Recent Activity

Articles

Organizations

Aurelien-Morgan's activity

Reacted to merve's post with ๐Ÿ‘ about 7 hours ago
view post
Post
375
The authors of ColPali trained a retrieval model based on SmolVLM ๐Ÿค  vidore/colsmolvlm-alpha
TLDR;

- ColSmolVLM performs better than ColPali and DSE-Qwen2 on all English tasks

- ColSmolVLM is more memory efficient than ColQwen2 ๐Ÿ’—
upvoted an article 1 day ago
view article
Article

Letโ€™s make a generation of amazing image generation models

By burtenshaw โ€ข
โ€ข 28
Reacted to thomwolf's post with ๐Ÿง  2 days ago
Reacted to danielhanchen's post with ๐Ÿ”ฅ 3 days ago
Reacted to AdinaY's post with ๐Ÿ˜Ž 8 days ago
Reacted to PLB's post with ๐Ÿš€ 8 days ago
view post
Post
1843
โš ๏ธ People selling AI chatbots for websites hate us.
Add an open source chat assistant on your website in 5 minutes: https://github.com/phospho-app/ai-chat-bubble

How does it work ?
- You give an URL
- The AI assistant crawls the website content and embed it
- Add it to your frontend in one line of code
- People on your website can ask the assistant questions

Powered by BAAI/bge-small-en-v1.5 and Mistral AI
ยท
Reacted to m-ric's post with ๐Ÿ‘€ 9 days ago
view post
Post
377
๐— ๐—ฒ๐˜๐—ฎ ๐˜๐—ฒ๐—ฎ๐—บ ๐—ท๐˜‚๐˜€๐˜ ๐—ฑ๐—ฟ๐—ผ๐—ฝ๐—ฝ๐—ฒ๐—ฑ ๐˜๐—ต๐—ฒ ๐—ณ๐—ถ๐—ฟ๐˜€๐˜ ๐—ช๐—ฎ๐˜๐—ฒ๐—ฟ๐—บ๐—ฎ๐—ฟ๐—ธ๐—ถ๐—ป๐—ด ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น ๐˜๐—ต๐—ฎ๐˜ ๐—ป๐—ผ๐˜ ๐—ฒ๐—ฑ๐—ถ๐˜ ๐—ฐ๐—ฎ๐—ป ๐—ฏ๐—ฟ๐—ฒ๐—ฎ๐—ธ!๐Ÿ›ก๏ธ

๐Ÿค” Ever heard of watermarking? It's a technique that allows you to mark in an image its original source. It's our best shield against AI-generated deepfakes, or content stolen from artists! ๐ŸŽจ

๐ŸŽญ Watermarking systems are actually a pair of models: a watermark embedder that applies the watermark on the image, and its corresponding decoder that should detect the original watermark.

โ›” But current methods were very limited: they can only apply and detect the watermark on your image as a whole. So, if you're an attacker it's easy to break: just crop it! add text on top! or whatever, really, anything would work to break the watermark.

A team of researchers at Meta was not happy with this. ๐Ÿ˜ค

So to withstand real-world attacks, they decided to make a watermarking model that would also work on any sub-part of the image. It's a real paradigm shift: they consider watermarking not as an image classification task, but as an image segmentation task!

๐Ÿ—๏ธ ๐—”๐—ฟ๐—ฐ๐—ต๐—ถ๐˜๐—ฒ๐—ฐ๐˜๐˜‚๐—ฟ๐—ฒ
โ–ธ The "Embedder" (a variational autoencoder + embedder, 1.1M parameters in total) encodes a n-bit message into a watermark signal that is added to the original image
โ–ธ [Only during training] The "Augmenter" randomly distorts the image: masks parts, crops, resizes, compresses. It's basically torture at this point.
โ–ธ The "Extractor" (a vision transformer, or ViT, with 96M parameters) then re-extracts the message from the distorted image, by predicting a (1+n) vector per pixel to predict the watermarked parts and decode corresponding messages.

The performance blows existing models out of the water, they even created new tasks (segmentation-related) just to grok them!

Gerat work @pierrefdz and @tomsander1998 !

Paper here ๐Ÿ‘‰ Watermark Anything with Localized Messages (2411.07231)
Reacted to erikkaum's post with ๐Ÿ‘€๐Ÿ”ฅ 9 days ago
view post
Post
1667
A while ago I started experimenting with compiling the Python interpreter to WASM.

To build a secure, fast, and lightweight sandbox for code execution โ€” ideal for running LLM-generated Python code.

- Send code simply as a POST request
- 1-2ms startup times

Hack away:
https://github.com/ErikKaum/runner
Reacted to merve's post with ๐Ÿš€ 9 days ago
Reacted to maxiw's post with ๐Ÿค— 9 days ago
view post
Post
4576
I was curious to see what people post here on HF so I created a dataset with all HF Posts: maxiw/hf-posts

Some interesting stats:

Top 5 Authors by Total Impressions:
-----------------------------------
@merve : 171,783 impressions (68 posts)
@fdaudens : 135,253 impressions (81 posts)
@singhsidhukuldeep : 122,591 impressions (81 posts)
@akhaliq : 119,526 impressions (78 posts)
@MonsterMMORPG : 112,500 impressions (45 posts)

Top 5 Users by Number of Reactions Given:
----------------------------------------
@osanseviero : 1278 reactions
@clem : 910 reactions
@John6666 : 899 reactions
@victor : 674 reactions
@samusenps : 655 reactions

Top 5 Most Used Reactions:
-------------------------
โค๏ธ: 7048 times
๐Ÿ”ฅ: 5921 times
๐Ÿ‘: 4856 times
๐Ÿš€: 2549 times
๐Ÿค—: 2065 times
ยท
Reacted to ArthurZ's post with ๐Ÿ”ฅ 9 days ago
view post
Post
2267
Native tensor parallel has landed in transformers!!! https://github.com/huggingface/transformers/pull/34184 thanks a lot to the torch team for their support!

Contributions are welcome to support more models! ๐Ÿ”ฅ
Reacted to AdinaY's post with ๐Ÿ”ฅ 12 days ago
view post
Post
2516
Letโ€™s dive into the exciting releases from the Chinese community last week ๐Ÿ”ฅ๐Ÿš€
More details ๐Ÿ‘‰ https://huggingface.co/zh-ai-community

Code model:
โœจQwen 2.5 coder by Alibaba Qwen
Qwen/qwen25-coder-66eaa22e6f99801bf65b0c2f
โœจOpenCoder by InflyAI - Fully open code model๐Ÿ™Œ
infly/opencoder-672cec44bbb86c39910fb55e

Image model:
โœจHunyuan3D-1.0 by Tencent
tencent/Hunyuan3D-1

MLLM:
โœจJanusFlow by DeepSeek
deepseek-ai/JanusFlow-1.3B
deepseek-ai/JanusFlow-1.3B
โœจMono-InternVL-2B by OpenGVlab
OpenGVLab/Mono-InternVL-2B

Video model:
โœจCogVideoX 1.5 by ChatGLM
THUDM/CogVideoX1.5-5B-SAT

Audio model:
โœจFish Agent by FishAudio
fishaudio/fish-agent-v0.1-3b

Dataset:
โœจOPI dataset by BAAIBeijing
BAAI/OPI