umuthopeyildirim (Umut Hope YILDIRIM)

upvoted an article 3 months ago

Article

Decoding Strategies in Large Language Models

By

•

Oct 29, 2024

• 38

upvoted a paper 4 months ago

HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale

Paper • 2409.16299 • Published Sep 9, 2024 • 11

upvoted 3 papers 6 months ago

Gemma 2: Improving Open Language Models at a Practical Size

Paper • 2408.00118 • Published Jul 31, 2024 • 76

GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression

Paper • 2407.12077 • Published Jul 16, 2024 • 55

Searching for Best Practices in Retrieval-Augmented Generation

Paper • 2407.01219 • Published Jul 1, 2024 • 11

upvoted a collection 6 months ago

Noisy OCR

Collection

4 items • Updated Jul 8, 2024 • 1

upvoted a paper 7 months ago

GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices

Paper • 2406.08451 • Published Jun 12, 2024 • 24

upvoted a collection 7 months ago

PEFT

Collection

200 items • Updated Jul 8, 2024 • 16

upvoted a collection 8 months ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 225

upvoted a collection 9 months ago

MoEs papers reading list

Collection

60 items • Updated Nov 4, 2024 • 137

upvoted a collection 10 months ago

DBRX

Collection

DBRX is a mixture-of-experts (MoE) large language model trained from scratch by Databricks. • 3 items • Updated Mar 27, 2024 • 92

upvoted a paper 10 months ago

Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 10

upvoted a collection 12 months ago

🔍 Interpretability & Analysis of LMs

Collection

Outstanding research in LM interpretability and evaluation, summarized • 95 items • Updated 3 days ago • 96

upvoted a paper 12 months ago

In-Context Language Learning: Architectures and Algorithms

Paper • 2401.12973 • Published Jan 23, 2024 • 4

upvoted a collection 12 months ago

Fin-RWKV-V1

Collection

Attention free financial expert modal - RWKV V4 • 6 items • Updated Feb 2, 2024 • 1

upvoted 4 papers about 1 year ago

StarCoder: may the source be with you!

Paper • 2305.06161 • Published May 9, 2023 • 30

MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models

Paper • 2310.11954 • Published Oct 18, 2023 • 25

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Paper • 2310.11511 • Published Oct 17, 2023 • 75

Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts

Paper • 2310.11784 • Published Oct 18, 2023 • 10

upvoted a paper over 1 year ago

Context-Aware Meta-Learning

Paper • 2310.10971 • Published Oct 17, 2023 • 16

Umut Hope YILDIRIM PRO

AI & ML interests

Organizations

umuthopeyildirim's activity

Decoding Strategies in Large Language Models

HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale

Gemma 2: Improving Open Language Models at a Practical Size

GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression

Searching for Best Practices in Retrieval-Augmented Generation

Noisy OCR

GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices

PEFT

Model Merging

MoEs papers reading list

DBRX

Matryoshka Representation Learning

🔍 Interpretability & Analysis of LMs

In-Context Language Learning: Architectures and Algorithms

Fin-RWKV-V1

StarCoder: may the source be with you!

MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts

Context-Aware Meta-Learning