NVIDIA Papers - a jxtngx Collection

jxtngx 's Collections

Meta Llama models

Getting started models

Useful datasets

Embedding and Reranking Models

State space models

Information retrieval papers

Tool use papers

Image synthesis papers

NVIDIA Papers

updated 8 days ago

Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism

Paper • 1909.08053 • Published Sep 17, 2019 • 2
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models

Paper • 2405.17428 • Published May 27 • 17