34 5 75

Rajiv Shah

rajistics

https://www.rajivshah.com

AI & ML interests

None yet

Recent Activity

updated a dataset 3 days ago

rajistics/openhands-synthetic-conversations

published a dataset 3 days ago

rajistics/openhands-synthetic-conversations

liked a Space 2 months ago

OpenHands/openhands-index

View all activity

Organizations

updated a dataset 3 days ago

rajistics/openhands-synthetic-conversations

Updated 3 days ago • 2.01k

published a dataset 3 days ago

rajistics/openhands-synthetic-conversations

Updated 3 days ago • 2.01k

liked a Space 2 months ago

OpenHands Index

🤖

A Holistic Benchmark for Software Engineering

liked a model 5 months ago

infly/inf-query-aligner

Reinforcement Learning • 8B • Updated Jan 5 • 170 • 8

reacted to Kseniase's post with ❤️ 6 months ago

Post

6376

9 Recent advances in Multi-Agent Systems (all open-source)

The idea to split tasks across multiple agents instead of relying on one universal agent is now seen as one of the most effective ways to build an AI stack. Concepts like “agent swarms” were highlighted at the AI Engineer Code Summit in NYC (Nov 20–21) as the winning architecture. And this trend is not only about coding and software. It applies across all AI domains.

So here is some recent research that helps keep multi-agent systems (MAS) better and up-to-date:

1. LatentMAS → Latent Collaboration in Multi-Agent Systems (2511.20639)
AI agents share their hidden "thoughts" directly in latent space instead of talking through text. This makes collaboration and reasoning way faster and accurate (no extra training needed)

2. Puppeteer → Multi-Agent Collaboration via Evolving Orchestration (2505.19591)
Uses a “puppeteer” LLM that dynamically decides which agents (“puppets”) to call and in what order. By learning this orchestration with reinforcement learning (RL), the system solves complex tasks more efficiently and with fewer compute costs

3. MADD → MADD: Multi-Agent Drug Discovery Orchestra (2511.08217)
A MAS with 4 agents for drug discovery. It lets researchers describe a drug discovery task in plain language. Then MADD automatically builds and runs the full hit-identification pipeline, making AI-driven drug design a simple end-to-end workflow

4. Multi-Agent Tool-Integrated Policy Optimization (MATPO) → Multi-Agent Tool-Integrated Policy Optimization (2510.04678)
Lets one LLM act as multiple agents (like a planner and a worker) by using different prompts and training them together with RL. So you get the benefits of a multi-agent system without needing multiple models

If you're interested in trends in multi-agent for software development of the future, explore my article with the emergent playbook. This is super interesting → https://www.turingpost.com/p/aisoftwarestack
Also, subscribe to the Turing Post: https://www.turingpost.com/subscribe

Read further below ⬇️