The training datasets used for training the ChEmbed family of text embedding models
AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
Edit this README.md markdown file to author your organization card.
datasets
76
BASF-AI/uspto-title-abs-chem
Viewer
•
Updated
•
75.8k
•
5
BASF-AI/uspto-synth-query-abs-chem
Viewer
•
Updated
•
75.8k
•
8
BASF-AI/PlantCAD2_virtual_hackathon
Viewer
•
Updated
•
9
•
62
BASF-AI/dolma-pes2o-chemistry
Viewer
•
Updated
•
361k
•
122
•
1
BASF-AI/ChemRxiv-Papers
Viewer
•
Updated
•
30.4k
•
31
•
1
BASF-AI/ChemRxiv-Paragraphs
Viewer
•
Updated
•
209k
•
40
•
2
BASF-AI/ChemRxiv-Train-CC-BY
Viewer
•
Updated
•
139k
•
36
BASF-AI/dolma-chem-only-query-generated
Viewer
•
Updated
•
1.17M
•
56
BASF-AI/ChemRxivRetrieval
Viewer
•
Updated
•
79.5k
•
27
•
1
BASF-AI/ChemRxiv-Train-CC-BY-v2
Viewer
•
Updated
•
138k
•
19
•
2