osllm.ai Models Highlights Program
We believe there's no need to pay a token if you have a GPU on your computer.
Highlighting new and noteworthy models from the community. Join the conversation on Discord.
Model creator: ibm-granite
Official Website • Documentation • Discord
NEW: Subscribe to our mailing list for updates and news!
Email: support@osllm.ai
Acknowledgments
Our sincere gratitude to the Meta and Llama teams for their efforts in developing and releasing these models.
Model Overview
The Meta Llama 3.2 collection features multilingual large language models (LLMs), available in 1B and 3B sizes, with capabilities in both text input and output. The instruction-tuned Llama 3.2 models are optimized for multilingual dialogue, excelling in agentic retrieval and summarization tasks. They demonstrate superior performance on standard industry benchmarks compared to many open-source and closed chat models.
- Developer: Meta
- Architecture: Llama 3.2 is an auto-regressive language model utilizing an optimized transformer structure. Its tuned versions employ supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to align with human preferences for helpfulness and safety.
- Supported Languages: Officially supported languages include English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai. Llama 3.2 has been trained on a wider array of languages, and developers may further fine-tune the model for additional languages, subject to the Llama 3.2 Community License and Acceptable Use Policy. Responsible and safe deployment practices are required.
- Token Counts: Token references pertain solely to pretraining data. All versions employ Grouped-Query Attention (GQA) to enhance inference scalability.
Release Information
- Release Date: September 25, 2024
- Status: This is a static model based on an offline dataset. Future updates may further enhance model performance and safety.
- License: Llama 3.2 usage is governed by the Llama 3.2 Community License, a custom commercial license agreement.
Feedback and Further Information
For questions or feedback regarding Llama 3.2, please refer to the model README. Additional technical details and guidance on generation parameters, as well as usage recipes, can be found here.
Disclaimers
osllm.ai is not the creator, originator, or owner of any Model featured in the Community Model Program. Each Community Model is created and provided by third parties. osllm.ai does not endorse, support, represent, or guarantee the completeness, truthfulness, accuracy, or reliability of any Community Model. You understand that Community Models can produce content that might be offensive, harmful, inaccurate, or otherwise inappropriate, or deceptive. Each Community Model is the sole responsibility of the person or entity who originated such Model. osllm.ai may not monitor or control the Community Models and cannot, and does not, take responsibility for any such Model. osllm.ai disclaims all warranties or guarantees about the accuracy, reliability, or benefits of the Community Models. osllm.ai further disclaims any warranty that the Community Model will meet your requirements, be secure, uninterrupted, or available at any time or location, or error-free, virus-free, or that any errors will be corrected, or otherwise. You will be solely responsible for any damage resulting from your use of or access to the Community Models, your downloading of any Community Model, or use of any other Community Model provided by or through osllm.ai.
- Downloads last month
- 13
Model tree for osllmai-community/Qwen2.5-Coder-7B-Instruct-bnb-4bit
Base model
meta-llama/Llama-3.2-1B-Instruct