Batch Processing

Batch processing in AWS SageMaker is a method of running inference jobs on large datasets in a non-interactive manner.

How It Works

In batch processing, you submit your entire dataset as a single job and wait for SageMaker to process it. The results are then stored in an S3 bucket for later retrieval.

Benefits

Limitations

Features

Use Cases

Real-Time Inference

Real-time inference in AWS SageMaker is a method of running inference jobs that require immediate responses.

How It Works

In real-time inference, you send a single observation to your model and get a prediction back in real-time. The model is hosted on an endpoint that stays open as long as you need it.