Extra ✦

At this stage, we will explore advanced topics in the field of backend development and DevOps that will help you deepen your knowledge and skills in creating reliable and scalable systems for AI Agents. These concepts are especially useful for those aspiring to system architect or tech lead roles in AI projects.

Ask AI Instructions

instruction

Since these topics do not change over time, it is best for you to study them with a personal tutor - ChatGPT.

The learning process should be as follows:

you create a system prompt for ChatGPT (templates), where you describe your background, preferences, level of detail of explanations, etc.
copy the topic from the list (triple click), and ask ChatGPT to explain this topic to you
if you want to delve deeper, ask clarifying questions

At the moment, this is the most convenient way to learn the basics. In addition to concepts, you can study additional materials in the Gold, Silver, Extra sections.

Gold - be sure to study before communicating with ChatGPT
Ask AI - ask questions on each unfamiliar topic
Silver - secondary materials
Extra - in-depth topics

Golden

10 Sysdes Patterns

Why Kubernetes is so popular

Sysdes

More Sysdes

Ansible

Terraform

Ask AI

DevOps and Infrastructure

Nginx for AI systems: load balancing and request proxying
Kubernetes: orchestrating ML workflows in production (practical cases)
Kubernetes Operators: automating repetitive tasks (Overview)
GitOps for beginners: basic principles and ArgoCD setup
Kubernetes monitoring: Prometheus + Grafana (templates for AI)
Service Mesh: basic concepts of Istio/Linkerd (Briefly)
Helm: application templating (workshop for AI developer)
Canary Deployments: safe model updates (step-by-step guide)
Infrastructure as Code: comparison of Terraform and Pulumi (concept)
CI/CD pipelines: automating model training (end-to-end example)

Highload systems

DB Sharding: basic strategies for beginners
CQRS + Event Sourcing: architectural patterns (Overview)
Message queues: Kafka vs RabbitMQ (comparison for AI)
Backpressure: protecting systems from overload (practical examples)
Data consistency: basic patterns of distributed systems
Latency optimization: diagnosing problems in AI inference
Caching: multi-level strategies (practical cases)
Observability: monitoring AI pipelines (OpenTelemetry)
Big Data processing: Spark for beginners (basic concepts)
Rate Limiting: API protection (ready-made solutions and libraries)

Security and reliability

OAuth 2.0: practical implementation for AI systems
Model protection: basic methods against prompt injection
Zero Trust: basic principles (Brief overview)
Secrets Management: working with HashiCorp Vault (guide)
Fault Tolerance: templates for beginners (Overview)
gRPC: optimizing communication between microservices
Blue-Green Deployments: basic scenario for AI models
SLA/SLO/SLI: quality metrics (practical examples)
Security audit: main stages (checklist)
Redundancy: strategies for AI inference (Briefly)

Cloud technologies and financial optimization

Multi-cloud strategies: reducing dependence on providers for AI systems
FinOps: optimizing costs for cloud GPUs and TPUs for AI projects
Spot Instances: effective use for model training
Serverless for AI: architectural patterns and antipatterns
Cloud Native AI: effective use of cloud ML/AI services
Data Lake and Data Warehouse: architectures for AI data
Edge Computing: moving AI inference closer to data sources
Benchmarking cloud providers: methodology for AI workflows
Pay-as-you-go vs Reserved Instances: strategies for AI startups
Cloud automation: robots for monitoring and optimizing costs

Databases and storage for AI

Vector DBs: optimizing queries and indexing for RAG systems
Time Series DB: storing and analyzing time series for AI monitoring
NewSQL: modern distributed DBs with ACID guarantees
Data Lakehouse: architecture for AI startups (Delta Lake, Iceberg)
Column Store vs Row Store: choice for analytical AI systems
Embedded DB: local solutions for Edge AI (SQLite, DuckDB)
Transactional Outbox: reliable event transfer between services
Full-text search: Elasticsearch for hybrid search with AI
Database Federation: combining heterogeneous data sources
Graph DB: using for LLM knowledge graphs and recommendations

Silver

DevOps Roadmap for AI Engineer
Modern cloud application architecture patterns
Ansible vs Puppet vs Chef: comparative analysis
Testing distributed systems: approaches and tools

Extra

Developing custom Kubernetes operators for AI workflows
EventMesh: global event bus for microservice AI systems
WebAssembly as a runtime environment for lightweight AI models
eBPF: kernel-level monitoring and debugging for high-load AI systems
unikernels: minimalistic specialized OSs for AI inference
Functional programming in backend development: benefits for AI systems
SRE for AI systems: Google practices and processes
Quantum computing for AI: current state and prospects
Zero-downtime database migrations: strategies for continuous operation
Data Sovereignty: compliance with regional requirements for AI data

Golden​

Ask AI​

DevOps and Infrastructure​

Highload systems​

Security and reliability​

Cloud technologies and financial optimization​

Databases and storage for AI​

Silver​

Extra​