Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
3
Kadir Erturk
KadirErturk
Follow
0 followers
·
7 following
http://www.kadirerturk.com
kadirerturk
erturkkadir
AI & ML interests
sensors and machine learning
Recent Activity
Reacted to
singhsidhukuldeep
's
post
with 🔥
7 days ago
Good folks at @nvidia and @Tsinghua_Uni have released LLAMA-MESH - A Revolutionary Approach to 3D Content Generation! This innovative framework enables the direct generation of 3D meshes from natural language prompts while maintaining strong language capabilities. Here is the Architecture & Implementation! >> Core Components Model Foundation - If you haven't guessed it yet, it's built on the LLaMA-3.1-8B-Instruct base model - Maintains original language capabilities while adding 3D generation - Context length is set to 8,000 tokens 3D Representation Strategy - Uses the OBJ file format for mesh representation - Quantizes vertex coordinates into 64 discrete bins per axis - Sorts vertices by z-y-x coordinates, from lowest to highest - Sorts faces by the lowest vertex indices for consistency Data Processing Pipeline - Filters meshes to a maximum of 500 faces for computational efficiency - Applies random rotations (0°, 90°, 180°, 270°) for data augmentation - Generates ~125k mesh variations from 31k base meshes - Uses Cap3D-generated captions for text descriptions >> Training Framework Dataset Composition - 40% Mesh Generation tasks - 20% Mesh Understanding tasks - 40% General Conversation (UltraChat dataset) - 8x training turns for generation, 4x for understanding Training Configuration - Deployed on 32 A100 GPUs (for Nvidia, this is literally in-house) - 21,000 training iterations - Global batch size: 128 - AdamW optimizer with a 1e-5 learning rate - 30-step warmup with cosine scheduling - Total training time: approximately 3 days (based on the paper) This research opens exciting possibilities for intuitive 3D content creation through natural language interaction. The future of digital design is conversational!
updated
a dataset
13 days ago
KadirErturk/jenny-tts-tags-6h-v1
liked
a model
about 2 months ago
Helsinki-NLP/opus-mt-tc-big-tr-en
View all activity
Organizations
None yet
models
1
KadirErturk/image_info
Image-Text-to-Text
•
Updated
Mar 14
•
11
•
1
datasets
1
KadirErturk/jenny-tts-tags-6h-v1
Viewer
•
Updated
13 days ago
•
4k
•
31