Michael's picture

Michael

michaelfeil

·

https://michaelfeil.eu

michaelfeil

AI & ML interests

ML Inference

Recent Activity

updated a model about 9 hours ago

BAAI/bge-en-icl

New activity about 9 hours ago

BAAI/bge-en-icl:Infinity usage

New activity about 9 hours ago

nvidia/NV-Embed-v2:How to run infinity and nv-embed-2

View all activity

Articles

Accelerating Embedding & Reranking Models on AMD Using Infinity

Organizations

michaelfeil's activity

upvoted a paper 3 months ago

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

Paper • 2409.10516 • Published Sep 16 • 39

upvoted an article 5 months ago

Article

Mixedbread 🤝 deepset: Announcing our New German/English Embedding Model

By

•

Jul 19

• 15

upvoted a paper 6 months ago

MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention

Paper • 2407.02490 • Published Jul 2 • 23

upvoted a collection 8 months ago

📀 Dataset comparison models

1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated Jun 12 • 34

upvoted a paper over 1 year ago

Textbooks Are All You Need

Paper • 2306.11644 • Published Jun 20, 2023 • 142