Building a CI/CD Pipeline for FastAPI Application with GitHub Actions and AWS EC2

Building an automated CI/CD pipeline that deploys a FastAPI app to AWS EC2 with GitHub Actions and Docker.

Posted Nov 25, 2025 Updated Jul 3, 2026

CI/CD Architecture Diagram

By Seoultech

9 min read

Building a CI/CD Pipeline for FastAPI Application with GitHub Actions and AWS EC2

Introduction

This post documents my experience building an automated CI/CD pipeline for a FastAPI application. The goal was simple: push code to GitHub, and it automatically deploys to AWS EC2. What seemed straightforward turned into a multi-hour debugging session that taught me valuable lessons about Docker, async/sync database drivers, and SSH authentication.

Architecture Overview

The pipeline architecture follows a standard CI/CD flow:

Tech Stack:

Application: FastAPI + MySQL (Docker Compose)
CI/CD: GitHub Actions
Container Registry: Docker Hub
Deployment Target: AWS EC2 (Ubuntu)

Workflow:

Developer pushes code to main branch
GitHub Actions triggers automatically
Docker image is built and pushed to Docker Hub
GitHub Actions SSHs into EC2
EC2 pulls latest image and restarts containers

Implementation

Docker Compose Configuration

The application uses two containers orchestrated by Docker Compose:

  
services:
  db:
    image: mysql:8.0
    container_name: app_db
    environment:
      MYSQL_ROOT_PASSWORD: ${DB_ROOT_PASSWORD}
      MYSQL_DATABASE: appdb
      MYSQL_USER: appuser
      MYSQL_PASSWORD: ${DB_PASSWORD}
    volumes:
      - db_data:/var/lib/mysql
    healthcheck:
      test: ["CMD", "mysqladmin", "ping", "-h", "localhost"]
      timeout: 20s
      retries: 10

  app:
    image: ${DOCKER_HUB_USERNAME}/myapp:latest
    container_name: app
    env_file:
      - .env.prod
    ports:
      - "8000:8080"
    command: sh -c "uv run alembic upgrade head && uv run uvicorn app.main:app --host 0.0.0.0 --port 8080"
    restart: always
    depends_on:
      db:
        condition: service_healthy

Key Design Choice: The depends_on with service_healthy ensures the database is fully ready before the application starts. This prevents connection errors during startup.

Note: Modern Docker invokes Compose through the docker compose (v2 plugin) command rather than the legacy standalone docker-compose binary, and the top-level version key is now obsolete (Compose v2 ignores it and emits a warning).

GitHub Actions Workflow

  
name: CI/CD Pipeline

on:
  push:
    branches: [main]

jobs:
  build:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      
      - name: Login to Docker Hub
        uses: docker/login-action@v3
        with:
          username: $
          password: $
      
      - name: Build and Push
        run: |
          docker build -t $/myapp:latest .
          docker push $/myapp:latest

  deploy:
    needs: build
    runs-on: ubuntu-latest
    steps:
      - name: Deploy to EC2
        uses: appleboy/ssh-action@v1.0.3
        with:
          host: $
          username: $
          key: $
          script: |
            cd ~/app
            sudo docker pull $/myapp:latest
            sudo DOCKER_HUB_USERNAME=$ docker compose down
            sudo DOCKER_HUB_USERNAME=$ docker compose up -d
            sudo docker image prune -af

Note: This deploy step is a learning / first-automation example — it deploys latest, runs docker compose down, and prunes images with docker image prune -af. Before relying on it in production, harden it with immutable SHA tags, by avoiding deletion of the image you would roll back to, and by recreating only the app service. See the Operational Notes section below.

Problem 1: Missing Database Container

Symptom

sqlalchemy.exc.OperationalError: (MySQLdb.OperationalError) 
(2003, "Can't connect to MySQL server on 'db'")

Root Cause

My initial docker-compose.yaml only defined the application service. I assumed the database would be handled separately, but the application expected a container named db on the same Docker network.

Solution

Added MySQL service to docker-compose.yaml with:

Proper health check to ensure database readiness
Named volumes for data persistence
depends_on with health check condition

Lesson: Always define all service dependencies in your compose file, even if you plan to use external databases later.

Problem 2: Alembic Async Driver Issue

Symptom

sqlalchemy.exc.MissingGreenlet: greenlet_spawn has not been called; 
can't call await_only() here

Root Cause

The application correctly uses aiomysql (async driver) for runtime database operations. My Alembic env.py, however, was the default synchronous template, which calls connectable.connect() directly and therefore cannot drive an async (+aiomysql) URL. (Alembic can run migrations asynchronously — see its asyncio cookbook recipe — but that requires an async-aware env.py, which I did not have here.)

Analysis

Looking at the error stack trace:

  
File "/app/wapang/database/alembic/env.py", line 72, in run_migrations_online
    with connectable.connect() as connection:

Alembic’s env.py was trying to create a synchronous connection using an async driver URL.

Solution

Modified alembic/env.py to force synchronous driver during migrations:

  
from wapang.database.settings import DB_SETTINGS

# Force sync driver for Alembic
sync_url = DB_SETTINGS.url.replace("+aiomysql", "+pymysql")
config.set_main_option("sqlalchemy.url", sync_url)

def run_migrations_online() -> None:
    connectable = create_engine(sync_url, poolclass=pool.NullPool)
    
    with connectable.connect() as connection:
        context.configure(
            connection=connection, 
            target_metadata=target_metadata
        )
        with context.begin_transaction():
            context.run_migrations()

Lesson: Async and sync contexts require different database drivers. With this sync env.py setup, a sync URL (+pymysql) was needed even though the app runs on async (+aiomysql). If you instead adopt Alembic’s async env.py template, you can keep the async driver for migrations too.

Problem 3: Missing Authentication Secrets

Symptom

pydantic_core._pydantic_core.ValidationError: 2 validation errors for AuthSettings
ACCESS_TOKEN_SECRET: Field required
REFRESH_TOKEN_SECRET: Field required

Root Cause

The .env.prod file on EC2 was missing authentication-related environment variables that the application validates on startup using Pydantic.

Solution

Added to .env.prod:

  
ACCESS_TOKEN_SECRET=your-access-token-secret-here
REFRESH_TOKEN_SECRET=your-refresh-token-secret-here

Created env.prod.example in the repository to document all required variables:

  
# Database
DB_DIALECT=mysql
DB_DRIVER=aiomysql
DB_HOST=db
DB_PORT=3306
DB_USER=appuser
DB_PASSWORD=secure_password
DB_DATABASE=appdb

# Application
DEBUG=False
SECRET_KEY=your-secret-key

# Authentication
ACCESS_TOKEN_SECRET=your-access-token-secret
REFRESH_TOKEN_SECRET=your-refresh-token-secret

Lesson: Always maintain an example environment file in version control to document required configuration variables.

Problem 4: SSH Key Authentication Failure

Symptom

ssh: handshake failed: ssh: unable to authenticate, 
attempted methods [none], no supported methods remain

Root Cause

The EC2_SSH_KEY GitHub Secret was improperly formatted. SSH private keys are extremely sensitive to formatting - any missing newline or extra space breaks authentication.

Solution

Ensured the private key in GitHub Secrets includes:

Complete -----BEGIN RSA PRIVATE KEY----- header
All key content with proper line breaks
Complete -----END RSA PRIVATE KEY----- footer
No extra whitespace before or after

Example of correct format:

-----BEGIN RSA PRIVATE KEY-----
MIIEpAIBAAKCAQEAx...
(multiple lines)
...ending_characters
-----END RSA PRIVATE KEY-----

Lesson: When copying SSH keys to secrets managers, preserve exact formatting including all newlines.

Problem 5: Docker Permission Denied in CI/CD

Symptom

permission denied while trying to connect to the Docker daemon socket 
at unix:///var/run/docker.sock

Root Cause

Even though the ubuntu user was added to the docker group during EC2 setup, SSH remote commands don’t inherit group memberships in the same way as interactive sessions.

Solution

Prefix all Docker commands with sudo in the deployment script:

  
script: |
  cd ~/app
  sudo docker pull $/myapp:latest
  sudo DOCKER_HUB_USERNAME=$ docker compose pull
  sudo DOCKER_HUB_USERNAME=$ docker compose down
  sudo DOCKER_HUB_USERNAME=$ docker compose up -d

Important: When using sudo with environment variables, you must explicitly pass them through sudo VARIABLE=value command.

Lesson: Remote SSH command execution behaves differently from interactive shells. Explicit sudo usage is often necessary in CI/CD contexts.

Results

After resolving all issues, the pipeline works flawlessly:

The application is now accessible at the deployed endpoint, with Swagger UI showing all available API routes.

Deployment Metrics:

Build time: ~2-3 minutes
Deployment time: ~30 seconds
Total automation: Push to production in under 4 minutes
Simple, automated short-downtime deployment via Docker Compose

Note: This is not zero-downtime: the deploy script runs docker compose down then up -d, so the container is stopped and recreated, leaving a brief gap where the app is unavailable. True zero-downtime would require a blue-green or rolling-update strategy with a reverse proxy switching upstreams only after the new container passes health checks.

Security Considerations

Secrets Management: All sensitive data stored in GitHub Secrets, never in repository
Environment Separation: Production variables in .env.prod on server only
SSH Key Security: Private key with proper permissions (600), stored only in GitHub Secrets
EC2 Security Groups: Limited to necessary ports (22 for SSH, 8000 for application)

Key Takeaways

Database Health Checks: Always wait for dependencies to be ready before starting dependent services
Async vs Sync Drivers: A sync Alembic env.py needs a sync driver even when the app uses async; Alembic also supports an async env.py if you prefer to keep the async driver
Environment Documentation: Maintain example config files to document all required variables
SSH Key Formatting: Private keys must preserve exact formatting including all line breaks
CI/CD Permissions: Use explicit sudo in automated deployment scripts; don’t rely on group memberships

Operational Notes

A few caveats worth hardening before relying on this pipeline in production:

Tag images immutably, not just latest: Pushing only latest makes rollback painful, since the tag always points at the newest build. Also push an immutable tag — for example the commit SHA, myapp:${GITHUB_SHA} — so you can redeploy a specific known-good image when something breaks.
Watch out for docker image prune -af: This aggressively deletes all unused images, including the previous one you would roll back to. Be careful with it in the deploy script, or keep the last N tags around instead of pruning everything.
docker compose down stops the DB too: It brings down every service, including the database container, not just the app. To recreate only the application you can target it explicitly, e.g. docker compose up -d --pull always app.

Putting those caveats together, a more production-leaning variant of the build and deploy steps looks like this:

  
# --- production-leaning variant ---
# CI: build & push BOTH an immutable SHA tag and latest
- name: Build and Push
  run: |
    docker build -t $IMAGE:$ -t $IMAGE:latest .
    docker push $IMAGE:$
    docker push $IMAGE:latest

# Deploy on EC2: pull the exact SHA image and recreate ONLY the app service
# (assumes the compose app service uses `image: ${APP_IMAGE}`)
- name: Deploy to EC2
  # ...ssh-action with host/username/key...
  script: |
    cd ~/app
    sudo docker pull $IMAGE:$
    sudo APP_IMAGE=$IMAGE:$ docker compose up -d --no-deps app
    sudo docker image prune -f   # dangling only, so rollback images survive

Pinning the SHA, recreating only app with --no-deps, and pruning just dangling layers keeps the database untouched and leaves the previous image available to roll back to.

Conclusion

Building this CI/CD pipeline took significantly longer than expected due to the various issues encountered. However, each problem taught valuable lessons about Docker containerization, database driver compatibility, and deployment automation.

The end result is a robust pipeline that saves hours of manual deployment work and reduces the risk of human error. What once required multiple manual steps now happens automatically on every code push.

Resources

DevOps, CI/CD

This post is licensed under CC BY 4.0 by the author.

Introduction

Architecture Overview

Implementation

Docker Compose Configuration

GitHub Actions Workflow

Problem 1: Missing Database Container

Symptom

Root Cause

Solution

Problem 2: Alembic Async Driver Issue

Symptom

Root Cause

Analysis

Solution

Problem 3: Missing Authentication Secrets

Symptom

Root Cause

Solution

Problem 4: SSH Key Authentication Failure

Symptom

Root Cause

Solution

Problem 5: Docker Permission Denied in CI/CD

Symptom

Root Cause

Solution

Results

Security Considerations

Key Takeaways

Operational Notes

Conclusion

Resources

Trending Tags