You're Genius!
Di Zhang
qq8933
AI & ML interests
AI4Chem, LLM, Green LLM
Recent Activity
updated
a dataset
about 3 hours ago
qq8933/OpenLongCoT-Pretrain-v2
updated
a dataset
about 23 hours ago
qq8933/llama_o1_offline_training_data_v1
New activity
2 days ago
qq8933/OpenLongCoT-Pretrain
Organizations
qq8933's activity
replied to
their
post
7 days ago
replied to
their
post
16 days ago
main.py
is the entry for finetune, but codes need further improvements, see 'Call for contributors'
posted
an
update
17 days ago
Post
2269
Discovered an outrageous bug on the ChatGPT official website, especially for those using ad-blocking plugins. Please make sure to add
For users with Chinese IP addresses, consider adding this URL to the rules of your U.S. node, as the response headers from this site will report the user's physical location to GPT.
browser-intake-datadoghq.com
to your ad block whitelist. The ChatGPT webpage relies on this site for heartbeat detection, but since it belongs to an ad tracking network, it's included in major ad-blocking lists. (If you're using Clash, also remember to add it to the whitelist.) Failing to do so may cause the ChatGPT web interface to display a greyed-out send button after clicking, with no response.For users with Chinese IP addresses, consider adding this URL to the rules of your U.S. node, as the response headers from this site will report the user's physical location to GPT.
posted
an
update
19 days ago
Post
5513
LLaMA-O1: Open Large Reasoning Model Frameworks For Training, Inference and Evaluation With PyTorch and HuggingFace
Large Reasoning Models powered by Monte Carlo Tree Search (MCTS), Self-Play Reinforcement Learning, PPO, AlphaGo Zero's dua policy paradigm and Large Language Models!
https://github.com/SimpleBerry/LLaMA-O1/
What will happen when you compound MCTS โค LLM โค Self-Play โคRLHF?
Just a little bite of strawberry!๐
Past related works:
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning (2410.02884)
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B (2406.07394)
Large Reasoning Models powered by Monte Carlo Tree Search (MCTS), Self-Play Reinforcement Learning, PPO, AlphaGo Zero's dua policy paradigm and Large Language Models!
https://github.com/SimpleBerry/LLaMA-O1/
What will happen when you compound MCTS โค LLM โค Self-Play โคRLHF?
Just a little bite of strawberry!๐
Past related works:
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning (2410.02884)
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B (2406.07394)
posted
an
update
3 months ago
Post
1533
๐ Introducing ChemVLM, the first open-source multimodal large language model dedicated to chemistry!
๐Comparable performances with commercial models or specific OCR model but with dialogue capabilities!
โจ2B/26B Models Here! AI4Chem/ChemVLM-26B
Seeing and Understanding: Bridging Vision with Chemical Knowledge Via ChemVLM (2408.07246)
๐Comparable performances with commercial models or specific OCR model but with dialogue capabilities!
โจ2B/26B Models Here! AI4Chem/ChemVLM-26B
Seeing and Understanding: Bridging Vision with Chemical Knowledge Via ChemVLM (2408.07246)
replied to
their
post
4 months ago
ๅ็ซฏๅผๅธธ๏ผๆๆไบ๏ผๅจไฟฎๅค
posted
an
update
5 months ago
Post
653
Preview:
We will open source the 2.5B ChemVLM and the tool-enhanced ChemLLM-7B in the near future
We will open source the 2.5B ChemVLM and the tool-enhanced ChemLLM-7B in the near future
posted
an
update
5 months ago
Post
736
A great work based on ChemLLM from Open-source community!
Automatic Scientific Discovery guided by LLM!
https://github.com/zyzisastudyreallyhardguy/LLM4SD
Automatic Scientific Discovery guided by LLM!
https://github.com/zyzisastudyreallyhardguy/LLM4SD
posted
an
update
5 months ago
Post
996
New Appearance from Ollama Open WebUI!
And Also web search, Realtime talking and File RAG!
https://chemllm.org/
And Also web search, Realtime talking and File RAG!
https://chemllm.org/
posted
an
update
6 months ago
Post
2002
The First Multimodal Language Model dedicated for Chemistry.
Demo: https://v.chemllm.org/
Finetune based on ChemLLM-20B and InterViT-6B on MMChemExam and ChemOCR Datasets (coming soon...)
AI4Chem/ChemVLM-26B
ChemLLM: A Chemical Large Language Model (2402.06852)
Demo: https://v.chemllm.org/
Finetune based on ChemLLM-20B and InterViT-6B on MMChemExam and ChemOCR Datasets (coming soon...)
AI4Chem/ChemVLM-26B
ChemLLM: A Chemical Large Language Model (2402.06852)
replied to
their
post
6 months ago
We forked it from InternVL Repo
Post
2059
Hello, Vision World!
AI4Chem/ChemLLM-20B-Chat-DPO
ChemLLM: A Chemical Large Language Model (2402.06852)
AI4Chem/ChemLLM-20B-Chat-DPO
ChemLLM: A Chemical Large Language Model (2402.06852)
posted
an
update
6 months ago
Post
2059
Hello, Vision World!
AI4Chem/ChemLLM-20B-Chat-DPO
ChemLLM: A Chemical Large Language Model (2402.06852)
AI4Chem/ChemLLM-20B-Chat-DPO
ChemLLM: A Chemical Large Language Model (2402.06852)
posted
an
update
7 months ago
Post
2015
Chemllm.org Now transfered to ChemLLM-20B-DPO, Have a try now!๐ค
replied to
their
post
7 months ago
Sorry for network issues, still uploading...