Choms's picture

3 1 49

Choms

Choms

·

choms

AI & ML interests

None yet

Recent Activity

liked a model 22 days ago

perplexity-ai/r1-1776

liked a Space 4 months ago

Qwen/Qwen2.5-Coder-Artifacts

liked a Space 4 months ago

stabilityai/stable-diffusion-3.5-large

View all activity

Organizations

Choms's activity

liked a model 22 days ago

perplexity-ai/r1-1776

Text Generation • Updated 14 days ago • 40.9k • • 2.12k

liked 2 Spaces 4 months ago

Qwen2.5 Coder Artifacts

Generate code from a description

Stable Diffusion 3.5 Large

Generate images with SD3.5

liked a Space 6 months ago

Latent Navigation

liked a model 6 months ago

mattshumer/Reflection-Llama-3.1-70B

Text Generation • Updated Sep 24, 2024 • 676 • 1.72k

liked a Space 6 months ago

Yi Coder 9B

state-of-the-art coding performance with fewer than 10 B

liked 2 Spaces 7 months ago

Qwen2-VL-72B

Engage in multi-modal conversations with images and videos

Kolors Virtual Try-On

Upload images to try on clothes virtually

liked a model 7 months ago

nisten/Biggie-SmoLlm-0.15B-Base

Text Generation • Updated Aug 7, 2024 • 1.11k • • 234

liked 3 Spaces 7 months ago

FLUX LoRa the Explorer

Generate images based on prompts and LoRA models

FLUX.1 [merged]

Generate images from text descriptions

CogVideoX-2B

Text-to-Video

replied to TuringsSolutions's post 8 months ago

If you really think the issue is not charging money, you are in for a surprise...

liked a model 8 months ago

Goekdeniz-Guelmez/J.O.S.I.E.v4o

Any-to-Any • Updated Oct 29, 2024 • 24

liked a Space 9 months ago

Inference Playground

Engage in chat conversations

reacted to singh96aman's post with 🔥 9 months ago

Post

2088

𝗝𝘂𝗱𝗴𝗶𝗻𝗴 𝘁𝗵𝗲 𝗝𝘂𝗱𝗴𝗲𝘀: 𝗘𝘃𝗮𝗹𝘂𝗮𝘁𝗶𝗻𝗴 𝗔𝗹𝗶𝗴𝗻𝗺𝗲𝗻𝘁 𝗮𝗻𝗱 𝗩𝘂𝗹𝗻𝗲𝗿𝗮𝗯𝗶𝗹𝗶𝘁𝗶𝗲𝘀 𝗶𝗻 𝗟𝗟𝗠𝘀-𝗮𝘀-𝗝𝘂𝗱𝗴𝗲𝘀
Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges (2406.12624)

𝐂𝐚𝐧 𝐋𝐋𝐌𝐬 𝐬𝐞𝐫𝐯𝐞 𝐚𝐬 𝐫𝐞𝐥𝐢𝐚𝐛𝐥𝐞 𝐣𝐮𝐝𝐠𝐞𝐬 ⚖️?

We aim to identify the right metrics for evaluating Judge LLMs and understand their sensitivities to prompt guidelines, engineering, and specificity. With this paper, we want to raise caution ⚠️ to blindly using LLMs as human proxy.

Blog - https://huggingface.co/blog/singh96aman/judgingthejudges
Arxiv - https://arxiv.org/abs/2406.12624
Tweet - https://x.com/iamsingh96aman/status/1804148173008703509

@singh96aman @kartik727 @Srinik-1 @sankaranv @dieuwkehupkes

New activity in nerijs/pixel-art-xl 9 months ago

License

#7 opened over 1 year ago by

liked a Space 9 months ago

Instruction Synthesizer

Generate instruction-response pairs from text

liked a model 9 months ago

deepseek-ai/deepseek-vl-7b-chat

Image-Text-to-Text • Updated Mar 15, 2024 • 53.3k • 245

liked a Space 9 months ago

Omost

Generate images from text prompts using AI