sagar007 (Sagar pallai)

liked a Space 12 days ago

Running

282

👁

sagar007/phi-1_5-finetuned

Updated Sep 23

updated a Space 2 months ago

Running on Zero

2

🌍

Phi3.5 Mini Instruct Finetune

liked a Space 2 months ago

Running on Zero

2

🌍

Fine-tuned Phi-3.5 Chatbot

This project presents a fine-tuned version of Microsoft's Phi-3.5 model, optimized for enhanced conversational abilities and general knowledge tasks.

Model Details
- Base model: microsoft/Phi-3.5-mini-instruct
- Fine-tuning method: PEFT (Parameter-Efficient Fine-Tuning)
- Training data: [Brief description of your dataset]

Features
- Improved response generation for a wide range of topics
- Enhanced context understanding and coherence
- Optimized for deployment on Hugging Face Spaces

Usage
This model can be used for various natural language processing tasks, including:
- General conversation
- Question answering
- Task instructions
- Creative writing

Try out the model here :
sagar007/phi3.5_mini_instruct_finetune

Limitations
While this fine-tuned model shows improved performance, users should be aware of potential biases and limitations inherent in language models. Always critically evaluate the model's outputs.

Feedback
I welcome any feedback, suggestions, or questions about this project. Feel free to open an issue or contribute to further improvements!

#Phi35 #FineTuning #NLP #MachineLearning #HuggingFace

1 reply

·

updated a model 3 months ago

sagar007/phi3.5_finetune

Updated Sep 3 • 6

updated a Space 3 months ago

Runtime error

😻

Phi2 Finetune

updated 2 models 3 months ago

sagar007/phi2_25k

Updated Sep 3 • 2

sagar007/phi2_finetune

Updated Sep 3 • 2

Reacted to their post with 👍 3 months ago

Post

1265

Excited to share my new Gradio app featuring the impressive Llama-3.1-Storm-8B model!
This app demonstrates the capabilities of Llama-3.1-Storm-8B, an 8B parameter language model created by Ashvini Kumar Jindal, Pawan Kumar Rajpoot, Ankur Parikh,@akjindal53244
Key highlights of Llama-3.1-Storm-8B:

Outperforms Llama-3.1-8B-Instruct on multiple benchmarks:

Instruction Following (IFEval): +3.93%
Knowledge-driven QA (GPQA): +7.21%
Reduced Hallucinations (TruthfulQA): +9%
Function Calling (BFCL): +7.92%

Achieves impressive results with only 8B parameters
Uses innovative techniques like self-curation and model merging

Try out the model yourself: sagar007/lama_storm_8b

Kudos to the creators for pushing the boundaries of smaller language models! This work makes advanced AI more accessible and efficient.
#AI #NLP #MachineLearning #GradioApp #Llama3

updated a Space 3 months ago

Runtime error

2

🦀

Sagar Flux Dream Maker

New activity in sagar007/lama_storm_8b 3 months ago

Adding extra feature

#1 opened 3 months ago by

sagar007

Reacted to merve's post with 👍 3 months ago

Post

2368

NVIDIA just dropped NVEagle 🦅

Super impressive vision language model that comes in 7B, 13B and 13B fine-tuned on chat 💬
Model repositories: merve/nveagle-66d0705108582d73bb235c26
Try it: NVEagle/Eagle-X5-13B-Chat 💬 (works very well! 🤯)

This model essentially explores having different experts (MoE) for image encoder part of vision language model.
How? 🧐
The authors concatenate the vision encoder output tokens together, and they apply "pre-alignment" essentially fine-tune experts with frozen text encoder.

Then they freeze both experts and the decoder and just train the projection layer, and finally, they unfreeze everything for supervised fine-tuning ✨

In the paper, they explore different fusion strategies and vision encoders, extending basic CLIP encoder, and figure out simply concatenating visual tokens works well.
Rest of the architecture is quite similar to LLaVA. (see below the architecture)

liked a Space 3 months ago

Sleeping

8

👁

Lama Storm 8b

posted an update 3 months ago

Post

1265

Excited to share my new Gradio app featuring the impressive Llama-3.1-Storm-8B model!
This app demonstrates the capabilities of Llama-3.1-Storm-8B, an 8B parameter language model created by Ashvini Kumar Jindal, Pawan Kumar Rajpoot, Ankur Parikh,@akjindal53244
Key highlights of Llama-3.1-Storm-8B:

Outperforms Llama-3.1-8B-Instruct on multiple benchmarks:

Instruction Following (IFEval): +3.93%
Knowledge-driven QA (GPQA): +7.21%
Reduced Hallucinations (TruthfulQA): +9%
Function Calling (BFCL): +7.92%

Achieves impressive results with only 8B parameters
Uses innovative techniques like self-curation and model merging

Try out the model yourself: sagar007/lama_storm_8b

Kudos to the creators for pushing the boundaries of smaller language models! This work makes advanced AI more accessible and efficient.
#AI #NLP #MachineLearning #GradioApp #Llama3

updated a Space 3 months ago

Sleeping

8

👁

Sagar pallai

AI & ML interests

Recent Activity

Organizations

sagar007's activity

Qwen2.5 Coder Demo

FRIDAY

A100fientunephi

sagar007/phi-1_5-finetuned

Phi3.5 Mini Instruct Finetune

Phi3.5 Mini Instruct Finetune

YOLOV9

Pixtral

sagar007/phi3.5_finetune

Phi2 Finetune

sagar007/phi2_25k

sagar007/phi2_finetune

Sagar Flux Dream Maker

Adding extra feature

Lama Storm 8b

Lama Storm 8b