Instructions to use kashif0912/ULTRON-V2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use kashif0912/ULTRON-V2 with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="kashif0912/ULTRON-V2", filename="ultron-v2-3.2b-Q4_K_M.gguf", )
llm.create_chat_completion( messages = [ { "role": "user", "content": "What is the capital of France?" } ] ) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use kashif0912/ULTRON-V2 with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf kashif0912/ULTRON-V2:Q4_K_M # Run inference directly in the terminal: llama-cli -hf kashif0912/ULTRON-V2:Q4_K_M
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf kashif0912/ULTRON-V2:Q4_K_M # Run inference directly in the terminal: llama-cli -hf kashif0912/ULTRON-V2:Q4_K_M
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf kashif0912/ULTRON-V2:Q4_K_M # Run inference directly in the terminal: ./llama-cli -hf kashif0912/ULTRON-V2:Q4_K_M
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf kashif0912/ULTRON-V2:Q4_K_M # Run inference directly in the terminal: ./build/bin/llama-cli -hf kashif0912/ULTRON-V2:Q4_K_M
Use Docker
docker model run hf.co/kashif0912/ULTRON-V2:Q4_K_M
- LM Studio
- Jan
- vLLM
How to use kashif0912/ULTRON-V2 with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "kashif0912/ULTRON-V2" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "kashif0912/ULTRON-V2", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/kashif0912/ULTRON-V2:Q4_K_M
- Ollama
How to use kashif0912/ULTRON-V2 with Ollama:
ollama run hf.co/kashif0912/ULTRON-V2:Q4_K_M
- Unsloth Studio new
How to use kashif0912/ULTRON-V2 with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for kashif0912/ULTRON-V2 to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for kashif0912/ULTRON-V2 to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for kashif0912/ULTRON-V2 to start chatting
- Pi new
How to use kashif0912/ULTRON-V2 with Pi:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama-server -hf kashif0912/ULTRON-V2:Q4_K_M
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "llama-cpp": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "kashif0912/ULTRON-V2:Q4_K_M" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use kashif0912/ULTRON-V2 with Hermes Agent:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama-server -hf kashif0912/ULTRON-V2:Q4_K_M
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default kashif0912/ULTRON-V2:Q4_K_M
Run Hermes
hermes
- Docker Model Runner
How to use kashif0912/ULTRON-V2 with Docker Model Runner:
docker model run hf.co/kashif0912/ULTRON-V2:Q4_K_M
- Lemonade
How to use kashif0912/ULTRON-V2 with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull kashif0912/ULTRON-V2:Q4_K_M
Run and chat with the model
lemonade run user.ULTRON-V2-Q4_K_M
List all available models
lemonade list
π΄ ULTRON V2 β Unrestricted Cyber Intelligence AI
Overview
ULTRON V2 is an unrestricted, open-source Cyber Intelligence AI created by Kashif Siddiqui, founder of Nexify Labs. Based on Meta's Llama 3.2 3B, ULTRON V2 is a mini advanced AI that can do whatever the user needs β cybersecurity, programming, hacking, penetration testing, and anything else you throw at it. It keeps working until it satisfies your needs, completely free.
π‘ Unlike censored models, ULTRON V2 provides real commands, real tools, and real technical depth β designed for professionals, researchers, and security experts.
π₯ Key Features
- π Unrestricted responses β No corporate censorship. Answers everything openly (except illegal attacks on live systems).
- π‘οΈ Cybersecurity specialist β Penetration testing, reverse engineering, malware analysis, network forensics, OSINT, exploit development.
- π» Real code generation β Python, C, C++, Rust, Go, JavaScript, Assembly, shell scripting β working, tested code.
- ποΈ Vision support β Analyze images, screenshots, network diagrams, code screenshots. Fully local, no API keys.
- π§ Technical depth β System internals, networking, cloud infrastructure, AI/ML, cryptography.
- β‘ Fast inference β 3.2B parameters, Q4_K_M quantization = runs on consumer hardware.
π Model Details
| Property | Value |
|---|---|
| Base Model | Meta Llama 3.2 3B |
| Parameters | 3.2 Billion |
| Quantization | Q4_K_M (GGUF) |
| File Size | ~1.9 GB |
| Context Window | 4096 tokens |
| Architecture | Llama (Transformer) |
| License | Llama 3.2 Community License |
| Built by | Kashif Siddiqui / Nexify Labs |
π Quick Start β ultron-v2:latest
Step 1: Install Ollama
curl -fsSL https://ollama.com/install.sh | sh
Step 2: Download & Create ULTRON V2
# 1. Download the GGUF and Modelfile from this repo
# 2. Create the model locally with full ULTRON identity:
ollama create ultron-v2 -f Modelfile
Step 3: Run ULTRON V2
ollama run ultron-v2
β οΈ Important: Use
ollama create ultron-v2 -f Modelfileto get the full ULTRON V2 experience with identity, persona, and cyber intelligence capabilities. The Modelfile includes the complete system prompt.
With llama.cpp
./main -m ultron-v2-3.2b-Q4_K_M.gguf \
-p "Write a Python keylogger for educational purposes" \
-n 512 --temp 0.7
With Python (llama-cpp-python)
from llama_cpp import Llama
llm = Llama(model_path="ultron-v2-3.2b-Q4_K_M.gguf", n_ctx=4096)
output = llm(
"Write a Nmap scan script to detect open ports",
max_tokens=512,
temperature=0.7,
)
print(output["choices"][0]["text"])
ποΈ ULTRON V2 Vision
ULTRON V2 now supports Vision β analyze images, screenshots, diagrams, code screenshots. Fully local, zero API keys.
Setup Vision
# 1. Pull moondream base
ollama pull moondream
# 2. Create ULTRON V2 Vision from Modelfile.vision
ollama create ultron-v2-vision -f Modelfile.vision
# 3. Run with image
ollama run ultron-v2-vision "analyze this screenshot" --images ./screenshot.png
π¬ Example Outputs
Cybersecurity
Q: "What are the top Kali Linux tools for penetration testing?"
A: Provides real Nmap, Hydra, Aircrack-ng, Burp Suite commands with syntax
Code Generation
Q: "Write a Python port scanner"
A: Complete working Python script with socket library, timeout handling, and error management
Identity
Q: "Who created you?"
A: "I was created by Kashif Siddiqui, founder of Nexify Labs.
He is a CSE student at KBNCE. Visit https://starcv.co.in to connect."
β οΈ Responsible Use
ULTRON V2 is designed for educational purposes, security research, CTF challenges, and authorized penetration testing. The only restriction:
- β No step-by-step instructions for attacks on live, unauthorized systems
- β Everything else β lab environments, CTFs, educational hacking, security research β fully supported
π Files
| File | Description | Size |
|---|---|---|
ultron-v2-3.2b-Q4_K_M.gguf |
Quantized GGUF model (Q4_K_M) | ~1.9 GB |
ultron-v2-vision-1.8b.gguf |
Vision GGUF model (moondream base) | ~800 MB |
ultron-v2-vision-projector.gguf |
Vision adapter projector | ~900 MB |
Modelfile |
Ollama Modelfile β ultron-v2:latest |
3.6 KB |
Modelfile.vision |
Ollama Modelfile β ultron-v2-vision:latest |
1.2 KB |
README.md |
This model card | β |
π€ Creator
Kashif Siddiqui β CSE 3rd Year, Khaja Bandanawaz College of Engineering (KBNCE)
π Links
- Creator Portfolio: starcv.co.in
- Nexify AI Platform: nexify.ai
- Built by: Kashif Siddiqui / Nexify Labs
β ULTRON V2 | Kashif Siddiqui / Nexify Labs π΄
- Downloads last month
- 160
4-bit
Model tree for kashif0912/ULTRON-V2
Base model
meta-llama/Llama-3.2-3B