S M Jishanul Islam's picture

6 10

S M Jishanul Islam

smji

·

https://s-m-j-i.github.io/Personal-CV/

S-M-J-I

AI & ML interests

Computer Vision, NLP, LLMs, Multimodal Deep Learning

Recent Activity

liked a model about 1 month ago

deepseek-ai/deepseek-math-7b-instruct

Organizations

smji's activity

upvoted an article 4 months ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11

• 104

upvoted a paper 8 months ago

MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Paper • 2404.05726 • Published Apr 8 • 20

upvoted a collection 8 months ago

PDF Document / OCR Datasets

Document datasets with .pdf files that are usable with pixparse libraries and tools. • 2 items • Updated Mar 30 • 47

upvoted 2 papers 8 months ago

VideoPrism: A Foundational Visual Encoder for Video Understanding

Paper • 2402.13217 • Published Feb 20 • 22

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14 • 124

upvoted a collection 8 months ago

Bengali Regional Text to IPA Models

A collection of models for transcribing Bengali Regional Text to the International Phonetic Alphabets (IPA). • 3 items • Updated Apr 6 • 1