magentic-ui
Magentic-UI is an AI agent designed to automate complex web and coding tasks while keeping users in control. It allows users to monitor websites, execute code, and analyze files, revealing its plan before execution and requesting approval for sensitive actions. The agent integrates with Microsoft's Fara-7B model and supports features like 'Tell me When' for automated monitoring and file upload capabilities. Users can extend its functionality with MCP agents and benefit from easier installation through Docker containers. Magentic-UI provides a user-friendly interface and a command-line interface for interacting with the agent. It is particularly useful for automating repetitive workflows and tasks that require ongoing monitoring and human oversight.
Magentic-UI solves the problem of automating complex web tasks that require monitoring and human intervention, such as ordering pizza or analyzing Airbnb prices. Instead of manually performing these tasks, which can be time-consuming and error-prone, users can leverage Magentic-UI to automate the process while maintaining control and oversight.
CAPABILITIES & CONSTRAINTS
README
<div align="center">
<img src="docs/img/magui-readme-logo.svg" alt="Magentic-UI Logo">
_Automate your web tasks while you stay in control_
[](https://pypi.python.org/pypi/magentic_ui)
[](https://pypi.python.org/pypi/magentic_ui)

[](https://arxiv.org/abs/2507.22358)
</div>
---
Magentic-UI is a **research prototype** human-centered AI agent that solves complex web and coding tasks that may require monitoring. Unlike other black-box agents, the system reveals its plan before executions, lets you guide its actions, and requests approval for sensitive operations while browsing websites, executing code, and analyzing files.
*Check out the [demo section](#demos) for inspiration on what tasks you can accomplish.*
## ✨ What's New
Microsoft latest agentic model [Fara-7B](https://www.microsoft.com/en-us/research/blog/fara-7b-an-efficient-agentic-model-for-computer-use/) is now integrated in Magentic-UI, read how to launch in <a href="#fara-7b"> Fara-7B guide</a>
- **"Tell me When"**: Automate monitoring tasks and repeatable workflows that require web or API access that span minutes to days. *Learn more [here](https://www.microsoft.com/en-us/research/blog/tell-me-when-building-agents-that-can-wait-monitor-and-act/).*
- **File Upload Support**: Upload any file through the UI for analysis or modification
- **MCP Agents**: Extend capabilities with your favorite MCP servers
- **Easier Installation**: We have uploaded our docker containers to GHCR so you no longer need to build any containers! Installation time now is much quicker.
## 🚀 Quick Start
Here's how you can get started with Magentic-UI:
```bash
# 1. Setup environment
python3 -m venv .venv
source .venv/bin/activate
pip install magentic-ui --upgrade
# 2. Set your API key
export OPENAI_API_KEY="your-api-key-here"
# 3. Launch Magentic-UI
magentic-ui --port 8081
```
Then open <http://localhost:8081> in your browser to interact with Magentic-UI!
> **Prerequisites**: Requires Docker and Python 3.10+. Windows users should use WSL2. See [detailed installation](#️-installation) for more info.
## Alternative Usage Options
**Without Docker** (limited functionality: no code execution):
```bash
magentic-ui --run-without-docker --port 8081
```
**Command Line Interface**:
```bash
magentic-cli --work-dir PATH/TO/STORE/DATA
```
**Custom LLM Clients**:
```bash
# Azure
pip install magentic-ui[azure]
# Ollama (local models)
pip install magentic-ui[ollama]
```
You can then pass a config file to the `magentic-ui` command (<a href="#model-client-configuration"> client config</a>) or change the model client inside the UI settings.
For further details on installation please read the <a href="#️-installation">🛠️ Installation</a> section. For common installation issues and their solutions, please refer to the [troubleshooting document](TROUBLESHOOTING.md). See advanced usage instructions with the command `magentic-ui --help`.
## Quick Navigation:
<p align="center">
<a href="#demos">🎬 Demos</a> |
<a href="#how-it-works">🟪 How it Works</a> |
<a href="#installation">🛠️ Installation</a> |
<a href="#troubleshooting">⚠️ Troubleshooting</a> |
<a href="#contributing">🤝 Contributing</a> |
<a href="#license">📄 License</a>
</p>
---
## Demos
<table>
<tr>
<td width="33%" align="center">
**🍕 Pizza Ordering**
*Web automation with human-in-the-loop*
<video src="https://github.com/user-attachments/assets/dc95cf5f-c4b4-4fe0-b708-158ff071e5a9" width="100%" style="max-height: 300px;">
</video>
</td>
<td width="33%" align="center">
**🏠 Airbnb Price Analysis**
*MCP agent integration*
<video src="https://github.com/user-attachments/assets/c19ed8c2-e06f-43b7-bee3-5e2ffc4c5e02" width="100%" style="max-height: 300px;">
</video>
</td>
<td width="33%" align="center">
**⭐ Star Monitoring**
*Long-running monitoring task*
<video src="https://github.com/user-attachments/assets/d2a463ca-7a94-4414-932d-a69f30fff63b" width="100%" style="max-height: 300px;">
</video>
</td>
</tr>
</table>
## How it Works
<p align="center">
<img src="./docs/img/magenticui_running.png" alt="Magentic-UI" height="400">
</p>
Magentic-UI is especially useful for web tasks that require actions on the web (e.g., filling a form, customizing a food order), deep navigation through websites not indexed by search engines (e.g., filtering flights, finding a link from a personal site) or tasks that need web navigation and code execution (e.g., generate a chart from online data).
What differentiates Magentic-UI from other browser use offerings is its transparent and controllable interface that allows for efficient human-in-the-loop involvement. Magentic-UI is built using [AutoGen](https://github.com/microsoft/autogen) and provides a platform to study human-agent interaction and experiment with web agents. Key features include:
- 🧑🤝🧑 **Co-Planning**: Collaboratively create and approve step-by-step plans using chat and the plan editor.
- 🤝 **Co-Tasking**: Interrupt and guide the task execution using the web browser directly or through chat. Magentic-UI can also ask for clarifications and help when needed.
- 🛡️ **Action Guards**: Sensitive actions are only executed with explicit user approvals.
- 🧠 **Plan Learning and Retrieval**: Learn from previous runs to improve future task automation and save them in a plan gallery. Automatically or manually retrieve saved plans in future tasks.
- 🔀 **Parallel Task Execution**: You can run multiple tasks in parallel and session status indicators will let you know when Magentic-UI needs your input or has completed the task.
<div align="center">
<a href="https://www.youtube.com/watch?v=wOs-5SR8xOc" target="_blank">
<img src="https://img.youtube.com/vi/wOs-5SR8xOc/maxresdefault.jpg" alt="Watch the demo video" width="600"/>
</a>
<br>
▶️ <em> Click to watch a video and learn more about Magentic-UI </em>
</div>
### Autonomous Evaluation
To evaluate its autonomous capabilities, Magentic-UI has been tested against several benchmarks when running with o4-mini: [GAIA](https://huggingface.co/datasets/gaia-benchmark/GAIA) test set (42.52%), which assesses general AI assistants across reasoning, tool use, and web interaction tasks ; [AssistantBench](https://huggingface.co/AssistantBench) test set (27.60%), focusing on realistic, time-consuming web tasks; [WebVoyager](https://github.com/MinorJerry/WebVoyager) (82.2%), measuring end-to-end web navigation in real-world scenarios; and [WebGames](https://webgames.convergence.ai/) (45.5%), evaluating general-purpose web-browsing agents through interactive challenges.
To reproduce these experimental results, please see the following [instructions](experiments/eval/README.md).
If you're interested in reading more checkout our [technical report](https://www.microsoft.com/en-us/research/wp-content/uploads/2025/07/magentic-ui-report.pdf) and [blog post](https://www.microsoft.com/en-us/research/blog/magentic-ui-an-experimental-human-centered-web-agent/).
## Installation
### Pre-Requisites
**Note**: If you're using Windows, we highly recommend using [WSL2](https://docs.microsoft.com/en-us/windows/wsl/install) (Windows Subsystem for Linux).
1. If running on **Windows** or **Mac** you should use [Docker Desktop](https://www.docker.com/products/docker-desktop/) or if inside WSL2 you can install Docker directly inside WSL [docker in WSL2 guide](https://gist.github.com/dehsilvadeveloper/c3bdf0f4cdcc5c177e2fe9be671820c7). If running on **Linux**, you should use [Docker Engine](https://docs.docker.com/engine/install/).
If using Docker Desktop, make sure it is set up to use WSL2:
- Go to Settings > Res
[truncated…]PUBLIC HISTORY
IDENTITY
Identity inferred from code signals. No PROVENANCE.yml found.
Is this yours? Claim it →METADATA
README BADGE
Add to your README:
