developer-toolsai-ml

UniProt

by josefdc

UniProt — access complete UniProtKB protein entries, sequences, filtered searches and ID mapping across 200+ databases f

Integrates with UniProtKB protein database to provide complete protein entries, sequences, filtered searches, and ID mapping between 200+ database types for bioinformatics workflows and protein research.

github stars

2

No API key needed200+ database ID mappingProduction-ready with retry logic

best for

  • / Bioinformatics researchers analyzing proteins
  • / AI/ML workflows processing biological data
  • / Converting protein IDs between databases
  • / Automating protein sequence analysis

capabilities

  • / Fetch complete protein entries with sequences and annotations
  • / Search UniProtKB database with filters
  • / Map identifiers between 200+ database types
  • / Retrieve protein sequences and metadata
  • / Export UniProt flatfiles in txt or fasta format

what it does

Provides access to UniProtKB protein database for querying protein entries, sequences, annotations, and converting between 200+ database identifier types.

about

UniProt is a community-built MCP server published by josefdc that provides AI assistants with tools and capabilities via the Model Context Protocol. UniProt — access complete UniProtKB protein entries, sequences, filtered searches and ID mapping across 200+ databases f It is categorized under developer tools, ai ml. This server exposes 5 tools that AI clients can invoke during conversations and coding sessions.

how to install

You can install UniProt in your AI client of choice. Use the install panel on this page to get one-click setup for Cursor, Claude Desktop, VS Code, and other MCP-compatible clients. This server runs locally on your machine via the stdio transport.

license

MIT

UniProt is released under the MIT license. This is a permissive open-source license, meaning you can freely use, modify, and distribute the software.

readme

UniProt MCP Server

<!-- mcp-name: io.github.josefdc/uniprot-mcp -->

PyPI version Python versions License: MIT MCP Registry

A Model Context Protocol (MCP) server that provides seamless access to UniProtKB protein data. Query protein entries, sequences, Gene Ontology annotations, and perform ID mappings through a typed, resilient interface designed for LLM agents.

<a href="https://glama.ai/mcp/servers/@josefdc/Uniprot-MCP"> <img width="380" height="200" src="https://glama.ai/mcp/servers/@josefdc/Uniprot-MCP/badge" alt="UniProt Server MCP server" /> </a>

✨ Features

  • 🔌 Dual Transport: Stdio for local development and Streamable HTTP for remote deployments
  • 📊 Rich Data Access: Fetch complete protein entries with sequences, features, GO annotations, cross-references, and taxonomy
  • 🔍 Advanced Search: Full-text search with filtering by review status, organism, keywords, and more
  • 🔄 ID Mapping: Convert between 200+ database identifier types with progress tracking
  • 🛡️ Production Ready: Automatic retries with exponential backoff, CORS support, Prometheus metrics
  • 📝 Typed Responses: Structured Pydantic models ensure data consistency
  • 🎯 MCP Primitives: Resources, tools, and prompts designed for agent workflows

🚀 Quick Start

Installation

pip install uniprot-mcp

Run the Server

Local development (stdio):

uniprot-mcp

Remote deployment (HTTP):

uniprot-mcp-http --host 0.0.0.0 --port 8000

The HTTP server provides:

  • MCP endpoint: http://localhost:8000/mcp
  • Health check: http://localhost:8000/healthz
  • Metrics: http://localhost:8000/metrics (Prometheus format)

Test with MCP Inspector

npx @modelcontextprotocol/inspector uniprot-mcp

📚 MCP Primitives

Resources

Access static or dynamic data through URI patterns:

URIDescription
uniprot://uniprotkb/{accession}Raw UniProtKB entry JSON for any accession
uniprot://help/searchDocumentation for search query syntax

Tools

Execute actions and retrieve typed data:

ToolParametersReturnsDescription
fetch_entryaccession, fields?EntryFetch complete protein entry with all annotations
get_sequenceaccessionSequenceGet protein sequence with length and metadata
search_uniprotquery, size, reviewed_only, fields?, sort?, include_isoformSearchHit[]Full-text search with advanced filtering
map_idsfrom_db, to_db, idsMappingResultConvert identifiers between 200+ databases
fetch_entry_flatfileaccession, version, formatstringRetrieve historical entry versions (txt/fasta)

Progress tracking: map_ids reports progress (0.0 → 1.0) for long-running jobs.

Prompts

Pre-built templates for common workflows:

  • Summarize Protein: Generate a structured summary from a UniProt accession, including organism, function, GO terms, and notable features.

🔧 Configuration

Environment Variables

VariableDefaultDescription
UNIPROT_ENABLE_FIELDSunsetRequest minimal field subsets to reduce payload size
UNIPROT_LOG_LEVELinfoLogging level: debug, info, warning, error
UNIPROT_LOG_FORMATplainLog format: plain or json
UNIPROT_MAX_CONCURRENCY8Max concurrent UniProt API requests
MCP_HTTP_HOST0.0.0.0HTTP server bind address
MCP_HTTP_PORT8000HTTP server port
MCP_HTTP_LOG_LEVELinfoUvicorn log level
MCP_HTTP_RELOAD0Enable auto-reload: 1 or true
MCP_CORS_ALLOW_ORIGINS*CORS allowed origins (comma-separated)
MCP_CORS_ALLOW_METHODSGET,POST,DELETECORS allowed methods
MCP_CORS_ALLOW_HEADERS*CORS allowed headers

CLI Flags

# HTTP server flags
uniprot-mcp-http --host 127.0.0.1 --port 9000 --log-level debug --reload

📖 Usage Examples

Fetching a Protein Entry

# Using MCP client
result = await session.call_tool("fetch_entry", {
    "accession": "P12345"
})

# Returns structured Entry with:
# - primaryAccession, protein names, organism
# - sequence (length, mass, sequence string)
# - features (domains, modifications, variants)
# - GO annotations (biological process, molecular function, cellular component)
# - cross-references to other databases

Searching for Proteins

# Search reviewed human proteins
result = await session.call_tool("search_uniprot", {
    "query": "kinase AND organism_id:9606",
    "size": 50,
    "reviewed_only": True,
    "sort": "annotation_score"
})

# Returns list of SearchHit objects with accessions and scores

Mapping Identifiers

# Convert UniProt IDs to PDB structures
result = await session.call_tool("map_ids", {
    "from_db": "UniProtKB_AC-ID",
    "to_db": "PDB",
    "ids": ["P12345", "Q9Y6K9"]
})

# Returns MappingResult with successful and failed mappings

🛠️ Development

Prerequisites

  • Python 3.11 or 3.12
  • uv (recommended) or pip

Setup

# Clone the repository
git clone https://github.com/josefdc/Uniprot-MCP.git
cd Uniprot-MCP

# Install dependencies
uv sync --group dev

# Install development tools
uv tool install ruff
uv tool install mypy

Running Tests

# Run all tests with coverage
uv run pytest --maxfail=1 --cov=uniprot_mcp --cov-report=term-missing

# Run specific test file
uv run pytest tests/unit/test_parsers.py -v

# Run integration tests only
uv run pytest tests/integration/ -v

Code Quality

# Lint
uv tool run ruff check .

# Format
uv tool run ruff format .

# Type check
uv tool run mypy src

# Run all checks
uv tool run ruff check . && \
uv tool run ruff format --check . && \
uv tool run mypy src && \
uv run pytest

Local Development Server

# Stdio server
uv run uniprot-mcp

# HTTP server with auto-reload
uv run python -m uvicorn uniprot_mcp.http_app:app --reload --host 127.0.0.1 --port 8000

🏗️ Architecture

src/uniprot_mcp/
├── adapters/           # UniProt REST API client and response parsers
│   ├── uniprot_client.py  # HTTP client with retry logic
│   └── parsers.py         # Transform UniProt JSON → Pydantic models
├── models/
│   └── domain.py       # Typed data models (Entry, Sequence, etc.)
├── server.py           # MCP stdio server (FastMCP)
├── http_app.py         # MCP HTTP server (Starlette + CORS)
├── prompts.py          # MCP prompt templates
└── obs.py              # Observability (logging, metrics)

tests/
├── unit/               # Unit tests for parsers, models, tools
├── integration/        # End-to-end tests with VCR fixtures
└── fixtures/           # Test data (UniProt JSON responses)

📦 Publishing

This server is published to:

Building and Publishing

# Build distribution packages
uv build

# Publish to PyPI (requires token)
uv publish --token pypi-YOUR_TOKEN

# Publish to MCP Registry (requires GitHub auth)
mcp-publisher login github
mcp-publisher publish

See docs/registry.md for detailed registry publishing instructions.

🤝 Contributing

Contributions are welcome! Please:

  1. Read our Contributing Guidelines
  2. Follow our Code of Conduct
  3. Check the Security Policy for vulnerability reporting
  4. Review the Changelog for recent changes

Quick start for contributors:

  1. Fork the repository
  2. Create a feature branch (git checkout -b feature/amazing-feature)
  3. Make your changes with tests
  4. Run quality checks: uv tool run ruff check . && uv tool run mypy src && uv run pytest
  5. Commit using Conventional Commits (feat:, fix:, docs:, etc.)
  6. Push and open a Pull Request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

  • UniProt Consortium: For providing comprehensive, high-quality protein data through their REST API
  • Anthropic: For the Model Context Protocol specification and Python SDK
  • Community: For feedback, bug reports, and contributions

🔗 Links

⚠️ Disclaimer

This is an independent project and is not officially affiliated with or endorsed by the UniProt Consortium. Please review UniProt's terms of use when using their data.


Built with ❤️ for the bioinformatics and AI communities

FAQ

What is the UniProt MCP server?
UniProt is a Model Context Protocol (MCP) server profile on explainx.ai. MCP lets AI hosts (e.g. Claude Desktop, Cursor) call tools and resources through a standard interface; this page summarizes categories, install hints, and community ratings.
How do MCP servers relate to agent skills?
Skills are reusable instruction packages (often SKILL.md); MCP servers expose live capabilities. Teams frequently combine both—skills for workflows, MCP for APIs and data. See explainx.ai/skills and explainx.ai/mcp-servers for parallel directories.
How are reviews shown for UniProt?
This profile displays 10 aggregated ratings (sample rows for discoverability plus signed-in user reviews). Average score is about 4.5 out of 5—verify behavior in your own environment before production use.
MCP server reviews

Ratings

4.510 reviews
  • Shikha Mishra· Oct 10, 2024

    UniProt is among the better-indexed MCP projects we tried; the explainx.ai summary tracks the official description.

  • Piyush G· Sep 9, 2024

    We evaluated UniProt against two servers with overlapping tools; this profile had the clearer scope statement.

  • Chaitanya Patil· Aug 8, 2024

    Useful MCP listing: UniProt is the kind of server we cite when onboarding engineers to host + tool permissions.

  • Sakshi Patil· Jul 7, 2024

    UniProt reduced integration guesswork — categories and install configs on the listing matched the upstream repo.

  • Ganesh Mohane· Jun 6, 2024

    I recommend UniProt for teams standardizing on MCP; the explainx.ai page compares cleanly with sibling servers.

  • Oshnikdeep· May 5, 2024

    Strong directory entry: UniProt surfaces stars and publisher context so we could sanity-check maintenance before adopting.

  • Dhruvi Jain· Apr 4, 2024

    UniProt has been reliable for tool-calling workflows; the MCP profile page is a good permalink for internal docs.

  • Rahul Santra· Mar 3, 2024

    According to our notes, UniProt benefits from clear Model Context Protocol framing — fewer ambiguous “AI plugin” claims.

  • Pratham Ware· Feb 2, 2024

    We wired UniProt into a staging workspace; the listing’s GitHub and npm pointers saved time versus hunting across READMEs.

  • Yash Thakker· Jan 1, 2024

    UniProt is a well-scoped MCP server in the explainx.ai directory — install snippets and categories matched our Claude Code setup.