3 31 165

Kristoffer Rolf Deinoff

gatepoet

AI & ML interests

None yet

Recent Activity

liked a model about 20 hours ago

sam-paech/Delirium-v1

liked a model about 24 hours ago

Sao10K/L3-8B-Stheno-v3.2

liked a dataset about 24 hours ago

Gryphe/Opus-WritingPrompts

View all activity

Organizations

None yet

gatepoet's activity

upvoted a paper 4 months ago

SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models

Paper • 2407.15841 • Published Jul 22 • 39

upvoted 2 collections 4 months ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Sep 25 • 624

Nemotron 4 340B

Collection

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 22 days ago • 159

upvoted a paper 6 months ago

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Paper • 2405.15071 • Published May 23 • 37

upvoted 2 papers 7 months ago

ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models

Paper • 2404.07738 • Published Apr 11 • 2

InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation

Paper • 2404.19427 • Published Apr 30 • 71

upvoted an article 7 months ago

Article

Fine Tuning a LLM Using Kubernetes with Intel® Xeon® Scalable Processors

•

Apr 24

• 5

upvoted 4 papers 7 months ago

MultiBooth: Towards Generating All Your Concepts in an Image from Text

Paper • 2404.14239 • Published Apr 22 • 8

The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

Paper • 2404.13208 • Published Apr 19 • 38

MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Paper • 2404.05726 • Published Apr 8 • 20

Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies

Paper • 2404.08197 • Published Apr 12 • 27

upvoted 2 papers 8 months ago

Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

Paper • 2404.05892 • Published Apr 8 • 31

Larimar: Large Language Models with Episodic Memory Control

Paper • 2403.11901 • Published Mar 18 • 32

upvoted 6 papers 9 months ago

Synth^2: Boosting Visual-Language Models with Synthetic Captions and Image Embeddings

Paper • 2403.07750 • Published Mar 12 • 21

upvoted a collection 9 months ago

MAGNeT

Collection

Masked Audio Generation using a Single Non-Autoregressive Transformer • 9 items • Updated Apr 4 • 40