Serverless containers on AWS

Deploy, manage, and scale containers without managing infrastructure.

Scale realtime, batch, and async workloads

Realtime

Respond to requests in real-time and autoscale based on in-flight request volumes.

Batch

Run distributed and fault-tolerant batch processing jobs on-demand.

Async

Process requests asynchronously and autoscale based on request queue length.

Allocate CPU, GPU, and memory without limits

No resource limits

Allocate as much CPU, GPU, and memory as each workload requires.

No cold starts

Keep a minimum number of replicas running to ensure that requests are handled in real-time.

No timeouts

Run workloads for as long as you want.

Control your AWS spend

Scale to zero

Optimize the autoscaling behavior of each workload to minimize idle resources.

Multi-instance

Run different workloads on different EC2 instances to ensure efficient resource utilization.

Spot instances

Run workloads on spot instances and fall back to on-demand instances to ensure reliability.

Create a cluster on your AWS account

EKS

Cortex runs on top of EKS to scale workloads reliably and cost-effectively.

SQS

Cortex automatically manages SQS queues for batch and async APIs.

EC2

Cortex automatically provisions EC2 instances to handle any workload.

VPC

Cortex deploys into a VPC on your account so your data stays private.

IAM

Cortex integrates with IAM to simplify authentication and authorization.

ECR

Cortex connects to public or private Docker container registries.

Discovering Cortex has been a lifesaver, it is servicing half a billion API calls each month for us. The ease with which we’ve been able to deploy Cortex has facilitated rapid development across our team, enabling us to meet the needs of our highly demanding customers.
Madison Bahmer - Two Six Technologies