Inference#
Complete reference for SageMaker HyperPod inference parameters and configuration options.
Create Inference Endpoint – Init Experience#
hyp init#
Initialize a template scaffold in the current directory.
Syntax#
hyp init TEMPLATE [DIRECTORY] [OPTIONS]
Parameters#
Parameter |
Type |
Required |
Description |
|---|---|---|---|
|
CHOICE |
Yes |
Template type (cluster-stack, hyp-pytorch-job, hyp-custom-endpoint, hyp-jumpstart-endpoint) |
|
PATH |
No |
Target directory (default: current directory) |
|
TEXT |
No |
Schema version to use |
hyp configure#
Configure training job parameters interactively or via command line.
Important
Pre-Deployment Configuration: This command modifies local config.yaml files before job creation.
Syntax#
hyp configure [OPTIONS]
Note: This command dynamically supports all configuration parameters available in the current template’s schema.
Parameters for Jumpstart Endpoint#
Parameter |
Type |
Required |
Description |
|---|---|---|---|
|
TEXT |
Yes |
JumpStart model identifier (1-63 characters, alphanumeric with hyphens) |
|
TEXT |
Yes |
EC2 instance type for inference (must start with “ml.”) |
|
BOOLEAN |
No |
Whether model terms of use have been accepted (default: false) |
|
TEXT |
No |
Semantic version of the model (e.g., “1.0.0”, 5-14 characters) |
|
TEXT |
No |
Name of SageMaker endpoint (1-63 characters, alphanumeric with hyphens) |
|
TEXT |
No |
S3 URI to write the TLS certificate (optional) |
Parameters for Custom Endpoint#
Parameter |
Type |
Required |
Description |
|---|---|---|---|
|
TEXT |
Yes |
EC2 instance type for inference (must start with “ml.”) |
|
TEXT |
Yes |
Name of model to create on SageMaker (1-63 characters, alphanumeric with hyphens) |
|
TEXT |
Yes |
Model source type (“s3” or “fsx”) |
|
TEXT |
Yes |
Docker image URI for inference |
|
INTEGER |
Yes |
Port on which model server listens (1-65535) |
|
TEXT |
Yes |
Name of the model volume mount |
|
TEXT |
No |
Name of SageMaker endpoint (1-63 characters, alphanumeric with hyphens) |
|
OBJECT |
No |
Environment variables as key-value pairs |
|
BOOLEAN |
No |
Enable metrics collection (default: false) |
|
TEXT |
No |
Version of the model (semantic version format) |
|
TEXT |
No |
Specific model data location |
|
BOOLEAN |
No |
Whether to pre-fetch model data (default: false) |
|
TEXT |
No |
S3 URI for TLS certificate output |
|
TEXT |
No |
FSx File System DNS Name |
|
TEXT |
No |
FSx File System ID |
|
TEXT |
No |
FSx File System Mount Name |
|
TEXT |
No |
S3 bucket location |
|
TEXT |
No |
S3 bucket region |
|
TEXT |
No |
Path inside container for model volume (default: “/opt/ml/model”) |
|
OBJECT |
No |
Resource limits for the worker |
|
OBJECT |
No |
Resource requests for the worker |
|
OBJECT |
No |
CloudWatch Metric dimensions as key-value pairs |
|
INTEGER |
No |
Period for CloudWatch query (default: 300) |
|
INTEGER |
No |
StartTime for CloudWatch query (default: 300) |
|
TEXT |
No |
Metric name to query for CloudWatch trigger |
|
TEXT |
No |
Statistics metric for CloudWatch (default: “Average”) |
|
TEXT |
No |
Type of metric for HPA (“Value” or “Average”, default: “Average”) |
|
NUMBER |
No |
Minimum metric value for empty CloudWatch response (default: 0) |
|
TEXT |
No |
Name for the CloudWatch trigger |
|
TEXT |
No |
AWS CloudWatch namespace for the metric |
|
NUMBER |
No |
Target value for the CloudWatch metric |
|
BOOLEAN |
No |
Enable caching of metric values (default: true) |
|
TEXT |
No |
Invocation endpoint path (default: “invocations”) |
Note: The exact parameters available depend on your current template type and version. Run hyp configure --help to see all available options for your specific configuration.
hyp validate#
Validate the current directory’s configuration file syntax and structure.
Syntax#
# Validate current configuration syntax
hyp validate
# Example output on success
✔️ config.yaml is valid!
# Example output with syntax errors
❌ Config validation errors:
– job_name: Field is required
Parameters#
No parameters required.
Note
This command performs syntactic validation only of the config.yaml file against the appropriate schema. It checks:
YAML syntax: Ensures file is valid YAML
Required fields: Verifies all mandatory fields are present
Data types: Confirms field values match expected types (string, number, boolean, array)
Schema structure: Validates against the template’s defined structure
This command performs syntactic validation only and does not verify the actual validity of values (e.g., whether AWS regions exist, instance types are available, or resources can be created).
Prerequisites
Must be run in a directory where
hyp inithas created configuration filesA
config.yamlfile must exist in the current directory
Output
Success: Displays confirmation message if syntax is valid
Errors: Lists specific syntax errors with field names and descriptions
hyp reset#
Reset the current directory’s config.yaml to default values.
Syntax#
hyp reset
Parameters#
No parameters required.
hyp create#
Create a new HyperPod endpoint using the provided configuration.
Syntax#
hyp create [OPTIONS]
Parameters#
Parameter |
Type |
Required |
Description |
|---|---|---|---|
|
FLAG |
No |
Enable debug logging |
Create Training Job – Direct Create#
hyp create hyp-jumpstart-endpoint#
Deploy pre-trained models from SageMaker JumpStart.
Syntax#
hyp create hyp-jumpstart-endpoint [OPTIONS]
Parameters#
Parameter |
Type |
Required |
Description |
|---|---|---|---|
|
TEXT |
Yes |
JumpStart model identifier (1-63 characters, alphanumeric with hyphens) |
|
TEXT |
Yes |
EC2 instance type for inference (must start with “ml.”) |
|
TEXT |
No |
Kubernetes namespace |
|
TEXT |
No |
Name of the jumpstart endpoint object |
|
BOOLEAN |
No |
Whether model terms of use have been accepted (default: false) |
|
TEXT |
No |
Semantic version of the model (e.g., “1.0.0”, 5-14 characters) |
|
TEXT |
No |
Name of SageMaker endpoint (1-63 characters, alphanumeric with hyphens) |
|
TEXT |
No |
S3 URI to write the TLS certificate (optional) |
|
FLAG |
No |
Enable debug mode (default: false) |
hyp create hyp-custom-endpoint#
Deploy custom models with your own inference code.
Syntax#
hyp create hyp-custom-endpoint [OPTIONS]
Parameters#
Parameter |
Type |
Required |
Description |
|---|---|---|---|
|
TEXT |
Yes |
EC2 instance type for inference (must start with “ml.”) |
|
TEXT |
Yes |
Name of model to create on SageMaker (1-63 characters, alphanumeric with hyphens) |
|
TEXT |
Yes |
Model source type (“s3” or “fsx”) |
|
TEXT |
Yes |
Docker image URI for inference |
|
INTEGER |
Yes |
Port on which model server listens (1-65535) |
|
TEXT |
Yes |
Name of the model volume mount |
|
TEXT |
No |
Kubernetes namespace |
|
TEXT |
No |
Name of the custom endpoint object |
|
TEXT |
No |
Name of SageMaker endpoint (1-63 characters, alphanumeric with hyphens) |
|
OBJECT |
No |
Environment variables as key-value pairs |
|
BOOLEAN |
No |
Enable metrics collection (default: false) |
|
TEXT |
No |
Version of the model (semantic version format) |
|
TEXT |
No |
Specific model data location |
|
BOOLEAN |
No |
Whether to pre-fetch model data (default: false) |
|
TEXT |
No |
S3 URI for TLS certificate output |
|
TEXT |
No |
FSx File System DNS Name |
|
TEXT |
No |
FSx File System ID |
|
TEXT |
No |
FSx File System Mount Name |
|
TEXT |
No |
S3 bucket location |
|
TEXT |
No |
S3 bucket region |
|
TEXT |
No |
Path inside container for model volume (default: “/opt/ml/model”) |
|
OBJECT |
No |
Resource limits for the worker |
|
OBJECT |
No |
Resource requests for the worker |
|
OBJECT |
No |
CloudWatch Metric dimensions as key-value pairs |
|
INTEGER |
No |
Period for CloudWatch query (default: 300) |
|
INTEGER |
No |
StartTime for CloudWatch query (default: 300) |
|
TEXT |
No |
Metric name to query for CloudWatch trigger |
|
TEXT |
No |
Statistics metric for CloudWatch (default: “Average”) |
|
TEXT |
No |
Type of metric for HPA (“Value” or “Average”, default: “Average”) |
|
NUMBER |
No |
Minimum metric value for empty CloudWatch response (default: 0) |
|
TEXT |
No |
Name for the CloudWatch trigger |
|
TEXT |
No |
AWS CloudWatch namespace for the metric |
|
NUMBER |
No |
Target value for the CloudWatch metric |
|
BOOLEAN |
No |
Enable caching of metric values (default: true) |
|
TEXT |
No |
Invocation endpoint path (default: “invocations”) |
|
FLAG |
No |
Enable debug mode (default: false) |
Inference Endpoint Management Commands#
Commands for managing inference endpoints.
hyp list hyp-jumpstart-endpoint#
List JumpStart model endpoints.
Syntax#
hyp list hyp-jumpstart-endpoint [OPTIONS]
Parameters#
Parameter |
Type |
Required |
Description |
|---|---|---|---|
|
TEXT |
No |
Namespace to list endpoints from (default: “default”) |
hyp list hyp-custom-endpoint#
List custom model endpoints.
Syntax#
hyp list hyp-custom-endpoint [OPTIONS]
Parameters#
Parameter |
Type |
Required |
Description |
|---|---|---|---|
|
TEXT |
No |
Namespace to list endpoints from (default: “default”) |
hyp describe hyp-jumpstart-endpoint#
Describe a JumpStart model endpoint.
Syntax#
hyp describe hyp-jumpstart-endpoint [OPTIONS]
Parameters#
Parameter |
Type |
Required |
Description |
|---|---|---|---|
|
TEXT |
Yes |
Name of the endpoint to describe |
|
TEXT |
No |
Namespace of the endpoint (default: “default”) |
|
FLAG |
No |
Display full JSON output |
hyp describe hyp-custom-endpoint#
Describe a custom model endpoint.
Syntax#
hyp describe hyp-custom-endpoint [OPTIONS]
Parameters#
Parameter |
Type |
Required |
Description |
|---|---|---|---|
|
TEXT |
Yes |
Name of the endpoint to describe |
|
TEXT |
No |
Namespace of the endpoint (default: “default”) |
|
FLAG |
No |
Display full JSON output |
hyp invoke hyp-jumpstart-endpoint#
Invoke a JumpStart model endpoint.
Syntax#
hyp invoke hyp-jumpstart-endpoint [OPTIONS]
Parameters#
Parameter |
Type |
Required |
Description |
|---|---|---|---|
|
TEXT |
Yes |
Name of the endpoint to invoke |
|
TEXT |
Yes |
Request body (JSON format) |
|
TEXT |
No |
Content type of the request (default: “application/json”) |
hyp invoke hyp-custom-endpoint#
Invoke a custom model endpoint.
Syntax#
hyp invoke hyp-custom-endpoint [OPTIONS]
Parameters#
Parameter |
Type |
Required |
Description |
|---|---|---|---|
|
TEXT |
Yes |
Name of the endpoint to invoke |
|
TEXT |
Yes |
Request body (JSON format) |
|
TEXT |
No |
Content type of the request (default: “application/json”) |
hyp delete hyp-jumpstart-endpoint#
Delete a JumpStart model endpoint.
Syntax#
hyp delete hyp-jumpstart-endpoint [OPTIONS]
Parameters#
Parameter |
Type |
Required |
Description |
|---|---|---|---|
|
TEXT |
Yes |
Name of the endpoint to delete |
|
TEXT |
No |
Namespace of the endpoint (default: “default”) |
hyp delete hyp-custom-endpoint#
Delete a custom model endpoint.
Syntax#
hyp delete hyp-custom-endpoint [OPTIONS]
Parameters#
Parameter |
Type |
Required |
Description |
|---|---|---|---|
|
TEXT |
Yes |
Name of the endpoint to delete |
|
TEXT |
No |
Namespace of the endpoint (default: “default”) |
hyp list-pods hyp-jumpstart-endpoint#
List pods for JumpStart endpoints.
Syntax#
hyp list-pods hyp-jumpstart-endpoint [OPTIONS]
Parameters#
Parameter |
Type |
Required |
Description |
|---|---|---|---|
|
TEXT |
No |
Namespace to list pods from (default: “default”) |
hyp list-pods hyp-custom-endpoint#
List pods for custom endpoints.
Syntax#
hyp list-pods hyp-custom-endpoint [OPTIONS]
Parameters#
Parameter |
Type |
Required |
Description |
|---|---|---|---|
|
TEXT |
No |
Namespace to list pods from (default: “default”) |
hyp get-logs hyp-jumpstart-endpoint#
Get logs from JumpStart endpoint pods.
Syntax#
hyp get-logs hyp-jumpstart-endpoint [OPTIONS]
Parameters#
Parameter |
Type |
Required |
Description |
|---|---|---|---|
|
TEXT |
Yes |
Name of the pod to get logs from |
|
TEXT |
No |
Container name to get logs from |
|
TEXT |
No |
Namespace of the pod (default: “default”) |
hyp get-logs hyp-custom-endpoint#
Get logs from custom endpoint pods.
Syntax#
hyp get-logs hyp-custom-endpoint [OPTIONS]
Parameters#
Parameter |
Type |
Required |
Description |
|---|---|---|---|
|
TEXT |
Yes |
Name of the pod to get logs from |
|
TEXT |
No |
Container name to get logs from |
|
TEXT |
No |
Namespace of the pod (default: “default”) |
hyp get-operator-logs hyp-jumpstart-endpoint#
Get operator logs for JumpStart endpoints.
Syntax#
hyp get-operator-logs hyp-jumpstart-endpoint [OPTIONS]
Parameters#
Parameter |
Type |
Required |
Description |
|---|---|---|---|
|
FLOAT |
Yes |
Time frame to get logs for (in hours) |
hyp get-operator-logs hyp-custom-endpoint#
Get operator logs for custom endpoints.
Syntax#
hyp get-operator-logs hyp-custom-endpoint [OPTIONS]
Parameters#
Parameter |
Type |
Required |
Description |
|---|---|---|---|
|
FLOAT |
Yes |
Time frame to get logs for (in hours) |
Parameter Reference#
Common Parameters Across Commands#
Parameter |
Type |
Description |
Default |
|---|---|---|---|
|
TEXT |
Kubernetes namespace |
Current context |
|
FLAG |
Show command help |
- |