19 3 85

Shuyue Jia (Bruce)

shuyuej

https://shuyuej.com

SuperBruceJia

AI & ML interests

A Ph.D. Student at @vkola-lab, Boston University. Passionate about Large Language Models (LLMs), Multimodal Foundation Models, Generative AI, and Medical AI.

Recent Activity

updated a model about 20 hours ago

shuyuej/Llama-3.1-70B-Instruct-2048

liked a model 3 days ago

shuyuej/SFR-Embedding-2_R-GPTQ

updated a model 3 days ago

shuyuej/SFR-Embedding-2_R-GPTQ

View all activity

Organizations

shuyuej's activity

New activity in shuyuej/e5-mistral-7b-instruct-GPTQ 3 months ago

missing model.safetensors.index.json

#1 opened 4 months ago by

kresimirfijacko

New activity in shuyuej/Mistral-Nemo-Instruct-2407-GPTQ 4 months ago

Can you create gptq 8 bits quants?

#1 opened 4 months ago by

rjmehta

New activity in hugging-quants/Meta-Llama-3.1-405B-Instruct-GPTQ-INT4 4 months ago

Can you provide one model using `group_size=1024` to make the model smaller?

#15 opened 4 months ago by

shuyuej

OOM Error

#13 opened 4 months ago by

shuyuej

Update quantize_config.json

#12 opened 4 months ago by

shuyuej

Update config.json

#11 opened 4 months ago by

shuyuej

Source codes to quantize the LLaMA 3.1 405B model

#10 opened 4 months ago by

shuyuej

New activity in hugging-quants/Meta-Llama-3.1-70B-Instruct-GPTQ-INT4 4 months ago

Request for Mistral Large Instruct GPTQ INT4

#2 opened 4 months ago by

sparsh35

New activity in mistralai/Mamba-Codestral-7B-v0.1 4 months ago

Missing config.json

#6 opened 4 months ago by

wxl2001

New activity in openerotica/c4ai-command-r-plus-GPTQ-ERQ 4 months ago

Where can we download `quant.py`?

#1 opened 4 months ago by

shuyuej

New activity in CohereForAI/c4ai-command-r-v01 4 months ago

Learning Rate during pretraining

#58 opened 4 months ago by

shuyuej

New activity in Salesforce/SFR-Embedding-2_R 4 months ago

About the tokenizer - Why use LLaMA tokenizer?

#5 opened 4 months ago by

shuyuej

New activity in dunzhang/stella_en_1.5B_v5 4 months ago

Model max_seq_length

#6 opened 4 months ago by

shuyuej

New activity in Salesforce/SFR-Embedding-2_R 4 months ago

Model max_seq_length

#4 opened 4 months ago by

shuyuej

New activity in openlifescienceai/open_medical_llm_leaderboard 6 months ago

Where can we find `eval_medical_llm.py` and `main.py`

#15 opened 6 months ago by

shuyuej

New activity in google/gemma-7b 6 months ago

Fine-Tune a gemma model for question answering

#62 opened 9 months ago by

Iamexperimenting

New activity in google/gemma-7b 7 months ago

Weird Performance Issue with Gemma-7b compared to Gemma-2b with Qlora

#91 opened 7 months ago by

UserDAN

New activity in mistralai/Mixtral-8x7B-Instruct-v0.1 8 months ago

What is the actual context size of mistralai/Mixtral-8x7B-Instruct-v0.1 model

#186 opened 8 months ago by

Pradeep1995

New activity in google/gemma-7b 8 months ago

Very different results with float16. [Actually, gemma-7b-it does not work with float16]

#33 opened 9 months ago by

EarthWorm001