Welcome to Orchestr8
Orchestr8 is an enterprise-grade AI platform built on Kubernetes, designed to make deploying and managing AI workloads simple, secure, and scalable.
What is Orchestr8?â
Orchestr8 transforms Kubernetes into a comprehensive AI platform by combining:
đ¤ AI Runtime Layerâ
- Llama-Stack: Enterprise-ready AI framework for LLM/RAG/Agent workloads
- Multi-Provider Support: OpenAI, Anthropic, Groq, AWS Bedrock, and local models
- GPU Orchestration: Intelligent GPU scheduling and resource management
- Vector Databases: ChromaDB, Qdrant, PGVector for RAG applications
đ Enterprise Security & Governanceâ
- ArgoCD: GitOps continuous delivery engine
- Istio: Service mesh for traffic management and mTLS
- Keycloak: Identity and Access Management (IAM)
- OAuth2 Proxy: Service authentication layer
- Network Policies: Zero-trust security by default
đ Operational Excellenceâ
- Prometheus/Grafana: AI-specific monitoring and observability
- Cert-Manager: Automatic TLS certificate management
- CloudNative PostgreSQL: High-performance database operator
- Model Management: Versioning, caching, and lifecycle management
Getting Startedâ
Prerequisitesâ
- uv package manager (automatically bootstraps Python)
- Kubernetes cluster with GPU nodes (for AI workloads)
- Additional CLI tools based on your deployment target
See the complete Installation Guide for setup instructions.
Installationâ
Install the Orchestr8 CLI using uv:
uv tool install orchestr8-platform
Verify the installation:
o8 --version
Choose Your Platformâ
đŠī¸ Cloud Deploymentâ
Perfect for production workloads with enterprise features:
- AWS - Deploy on Amazon EKS
- Azure - Deploy on Azure AKS
- Google Cloud - Deploy on Google GKE
đģ Local Developmentâ
Ideal for development, testing, and learning:
- Local Setup - kind, minikube, or Docker Desktop
One-Command Setupâ
Once you've chosen your platform:
# For cloud deployment (replace 'aws' with your provider)
o8 setup --provider aws --domain your-domain.com
# For local development
o8 setup --provider local
This command will:
- â Set up the GitOps platform with ArgoCD
- â Configure Istio service mesh
- â Install Keycloak for identity management
- â Deploy Llama-Stack AI runtime
- â Set up AI-specific monitoring with Prometheus/Grafana
- â Configure TLS certificate management
- â Initialize GPU resource management
Verify Your Installationâ
Check that everything is running:
o8 doctor
This command validates:
- Cluster connectivity and GPU nodes
- Platform components health
- AI runtime configuration
- Model provider access
- Configuration correctness
What's Next?â
- Deploy Your First AI Workload - Create RAG or agent applications
- Explore Core Concepts - Understand GitOps, modules, and architecture
- Deploy Your First Module - Try example applications
- Configure Security - Harden your deployment
- Complete Installation Guide - Detailed setup instructions
AI Workloadsâ
Orchestr8 makes it easy to deploy and manage AI applications:
đ¤ Create Your First AI Workloadâ
# Initialize a RAG application
o8 llama init my-rag-app --template rag --provider openai
# Or create an agentic workflow
o8 llama init my-agent --template agent --provider anthropic
# Deploy to your cluster
cd my-rag-app
o8 llama deploy --environment dev
đ Monitor AI Applicationsâ
# Check AI workload status
o8 llama status
# View real-time logs
o8 llama logs --follow
# List configured AI providers
o8 llama providers
đ§ Available AI Templatesâ
- RAG Applications: Retrieval-Augmented Generation with vector search
- Agentic Workflows: Multi-step reasoning with tool integration
- Inference Services: High-performance model serving
- Custom Workloads: Build your own AI applications
Key Featuresâ
đ¤ AI-Native Platformâ
- Multi-Provider AI: OpenAI, Anthropic, Groq, AWS Bedrock, and local models
- GPU Orchestration: Intelligent scheduling and resource sharing
- Vector Databases: ChromaDB, Qdrant, PGVector for RAG applications
- Model Management: Caching, versioning, and lifecycle automation
đ Security & Complianceâ
- Zero-Trust Networking: Default-deny network policies with explicit allow rules
- Pod Security Standards: Restricted security profiles enforced
- Identity Management: Keycloak OIDC with OAuth2 Proxy integration
- Compliance Ready: SOC2, GDPR, and HIPAA controls built-in
⥠Enterprise Operationsâ
- GitOps-First: All configuration stored in Git with ArgoCD managing deployments
- Multi-Environment: Support for local, dev, staging, production environments
- Module System: Deployable components with namespace isolation
- AI Monitoring: Specialized metrics for model performance and costs
Need Help?â
- đ Documentation: Browse our comprehensive guides
- đ Issues: Report bugs
- đŦ Community: Join our Discord
- đ§ Support: Contact support@agenticinsights.com