Together

Team

company

Verified

https://together.ai

togethercompute

togethercomputer

Inference Provider

2,149,616 monthly requests

AI & ML interests

Foundation Models, Decentralized Computing, Open Source AI.

Articles

Fine-tune Any LLM from the Hugging Face Hub with Together AI

JunxiongWang

authored a paper 9 months ago

M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models

Paper • 2504.10449 • Published Apr 14, 2025 • 15

mauriceweber

authored a paper about 1 year ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14, 2025 • 62

jason136

authored a paper about 1 year ago

METAGENE-1: Metagenomic Foundation Model for Pandemic Monitoring

Paper • 2501.02045 • Published Jan 3, 2025 • 22

kezhentogether

authored a paper about 1 year ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 56

mauriceweber

authored a paper about 1 year ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 56

JunxiongWang

authored a paper over 1 year ago

The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Paper • 2408.15237 • Published Aug 27, 2024 • 42

rhubarbwu

authored 5 papers over 1 year ago

NeuralArTS: Structuring Neural Architecture Search with Type Theory

Paper • 2110.08710 • Published Oct 17, 2021

Towards One Shot Search Space Poisoning in Neural Architecture Search

Paper • 2111.07138 • Published Nov 13, 2021

Poisoning the Search Space in Neural Architecture Search

Paper • 2106.14406 • Published Jun 28, 2021

SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound

Paper • 2406.06612 • Published Jun 6, 2024 • 16

Linguistic Collapse: Neural Collapse in (Large) Language Models

Paper • 2405.17767 • Published May 28, 2024

zhangce

authored a paper over 1 year ago

Mixture-of-Agents Enhances Large Language Model Capabilities

Paper • 2406.04692 • Published Jun 7, 2024 • 59

junlinw

authored a paper over 1 year ago

Mixture-of-Agents Enhances Large Language Model Capabilities

Paper • 2406.04692 • Published Jun 7, 2024 • 59

alpayariyak

authored a paper almost 2 years ago

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Paper • 2404.00399 • Published Mar 30, 2024 • 42

Zhongzhu

authored a paper almost 2 years ago

FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design

Paper • 2401.14112 • Published Jan 25, 2024 • 20

JunxiongWang

authored a paper almost 2 years ago

MambaByte: Token-free Selective State Space Model

Paper • 2401.13660 • Published Jan 24, 2024 • 60

jonsaadfalcon

authored a paper over 2 years ago

PDFTriage: Question Answering over Long, Structured Documents

Paper • 2309.08872 • Published Sep 16, 2023 • 53

Zhongzhu

authored a paper over 2 years ago

DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales

Paper • 2308.01320 • Published Aug 2, 2023 • 45

JunxiongWang

authored a paper over 2 years ago

Pretraining Without Attention

Paper • 2212.10544 • Published Dec 20, 2022

zhangce

updated a model over 2 years ago

togethercomputer/RedPajama-INCITE-Chat-3B-v1

Text Generation • Updated May 9, 2023 • 1.12k • 152