VLArm

Goal: creating a more 'human' robotic arm that can be used in industry settings with the power of Vision-Language-Action models.

The twist: This project will take an existing robot (HuggingFace's SO-101) and give it the capability to make decisions 'for itself' with the power of a locally-running MCP and LLM (from Ollama). Essentially, the robot will take actions based on language commands, controlled with a locally running Large Language Model.

Future steps: After I have the language commands set up, I will add Vision and Listening capabilities that let the model hear someone's commands and also see what it is doing live.

How I'll build it

I will need approximately $150 to feasibly build this project.

Structure

This repository contains the following files and directories:

BOM.csv: Parts needed to build the arm.
3D: 3D models and Ultimaker Cura files for 3D printing the casing for the arm.
mcp: A folder with the files for setting up an MCP server that can be used to control the arm via an LLM.

Setting up MCP for Ollama

The mcp folder in the main directory contains the files for running an MCP with Anthropic's open-source protocol. Anthropic's protocol can be used with Ollama through ollmcp, which is an open-source MCP client for Ollama. This folder contains files that can be configured with ollmcp to have a locally-running LLM through Ollama make tool calls to move the arm.

Instructions

Set up the Prerequisites:

Ollama (https://ollama.com/): Make sure to run ollama run qwen2.5 to get an LLM with tool-use running locally after Ollama is installed.
Phosphobot (Download and run locally): https://phospho.mintlify.app/installation
Python

Then:

Run pip install ollmcp.
Assuming you've cloned this repository, go to the mcp directory. Install the dependencies in the requirements.txt file.
Run ollmcp --mcp-server server.py --model qwen2.5.

Credits

This project is based on the HuggingFace/TheRobotStudio SO-100 tutorial, Phosphobot API, and LeRobot library:

License

The SO-100 Arm 3D models included in this repository are licensed under the Apache license: https://github.com/TheRobotStudio/SO-ARM100/blob/main/LICENSE

Any material that is not sourced from that repository is licensed by the MIT License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

VLArm

How I'll build it

Structure

Setting up MCP for Ollama

Instructions

Credits

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
3D		3D
mcp		mcp
BOM.csv		BOM.csv
LICENSE		LICENSE
README.md		README.md

License

gconsigli/VLArm

Folders and files

Latest commit

History

Repository files navigation

VLArm

How I'll build it

Structure

Setting up MCP for Ollama

Instructions

Credits

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages