WarpHelix Technical Architecture & Security
From Stanford's Biomni to enterprise-grade product
← Back to WarpHelix
Foundation: Stanford Biomni
Biomni is Stanford's first general-purpose biomedical AI agent framework, featuring two core innovations:
Biomni-E1: Unified Biomedical Environment
Through Action Discovery, Biomni systematically mines tools and databases from thousands of papers, creating the first unified agent execution environment for biomedicine.
10,000+ biomedical papers
↓ Action Discovery
150+ tools + 59 databases + 105 software packages
↓ Unified interface
Biomni-E1 Unified Environment
Biomni-A1: Generalist Agent Architecture
- LLM reasoning — Understands questions, plans analysis strategies
- Retrieval-augmented planning — Finds optimal tool combinations
- Code execution — Auto-generates and runs analysis code
- Iterative refinement — Checks results, auto-adjusts approach
WarpHelix Architecture Overview
┌─────────────────────────────────────────┐
│ User Layer │
│ Web UI ←→ WebSocket Real-time │
├─────────────────────────────────────────┤
│ Application Services │
│ Auth │ Metering │ Sessions │ Files │
├─────────────────────────────────────────┤
│ AI Agent Layer │
│ LLM Reasoning ↔ Tool Discovery ↔ Code │
│ Retrieval-Augmented Planning │
├─────────────────────────────────────────┤
│ Biomni-E1 Environment │
│ 150+ Tools │ 59 DBs │ 105 Packages │
├─────────────────────────────────────────┤
│ Infrastructure │
│ LLM API │ Database │ Storage │ Sandbox │
└─────────────────────────────────────────┘
AI Agent Core Workflow
User Question
├── 🧠 Think — Understand intent, plan analysis
├── 🔍 Search — Retrieve relevant tools from 150+
├── ⚡ Execute — Generate and run code in sandbox
├── 👁️ Observe — Validate results, iterate if needed
└── 📋 Solution — Synthesize report + visualizations
Retrieval-Augmented Planning (RAP)
- Tool Retrieval — Vector search for relevant tool documentation
- Context Injection — Feed tool docs, usage examples into LLM context
- Plan Generation — LLM produces precise analysis plan with full context
- Execution Validation — Verify each step, adjust as needed
Benefit: Even if the LLM hasn't "memorized" a specific bioinformatics tool, it can retrieve accurate documentation — drastically reducing hallucinations.
Security Sandbox
| Layer | Measure |
|---|
| Network | Agent can only access whitelisted public databases |
| Files | Per-user isolated file spaces |
| Resources | CPU, memory, disk limits per execution |
| Timeout | Execution time limits prevent infinite loops |
| Audit | All code execution fully logged |
🔒 Data Security & Compliance
Data Flow
User Input → HTTPS → WarpHelix Server
├── Conversations → Encrypted DB (user-isolated)
├── Files → Encrypted storage (user-isolated)
├── Code → Secure sandbox execution
└── LLM calls → AWS Bedrock / DashScope
(no storage, no training)
Security Commitments
| Commitment | Detail |
|---|
| 🔐 Encryption | TLS 1.3 in transit + AES-256 at rest |
| 🚫 No Training | User data never used for model training |
| 🏠 Private Deploy | Full stack deployable on your infrastructure |
| 👤 Data Isolation | Strict per-user isolation |
| 📋 Audit Trail | Complete operation logging |
| 🗑️ Data Deletion | Users can delete all their data |
Compliance
- GDPR — Data access and deletion rights supported
- China Classified Protection — Private deployment meets requirements
- Institutional audits — Comprehensive audit logs
Deployment
Docker Compose one-click deployment:
| Config | CPU | RAM | Storage | Scale |
|---|
| Minimum | 4 cores | 16 GB | 100 GB SSD | 10 users |
| Recommended | 8 cores | 32 GB | 500 GB SSD | 50 users |
| High | 16 cores | 64 GB | 1 TB SSD | 200+ users |
← Back to WarpHelix