aws
diff --git a/‎doc/_static/image.png
2.51 KB b/‎doc/_static/image.png
2.51 KB
diff --git a/‎doc/_static/image_dark.png
36.9 KB b/‎doc/_static/image_dark.png
36.9 KB
diff --git a/‎doc/_static/image_light.svg
Lines changed: 1 addition & 0 deletions b/‎doc/_static/image_light.svg
Lines changed: 1 addition & 0 deletions
diff --git a/‎doc/conf.py
Lines changed: 6 additions & 2 deletions b/‎doc/conf.py
Lines changed: 6 additions & 2 deletions
diff --git a/‎doc/getting_started.md
Lines changed: 120 additions & 0 deletions b/‎doc/getting_started.md
Lines changed: 120 additions & 0 deletions
diff --git a/‎doc/index.md
Lines changed: 6 additions & 5 deletions b/‎doc/index.md
Lines changed: 6 additions & 5 deletions
diff --git a/‎doc/inference.md
Lines changed: 198 additions & 0 deletions b/‎doc/inference.md
Lines changed: 198 additions & 0 deletions
@@ -81,10 +81,10 @@ def get_version():
     "sphinx.ext.todo",
     "sphinx.ext.viewcode",
     "nbsphinx",
-    # Use either myst_parser or myst_nb, not both
-    # "myst_parser",
     "myst_nb",
     "sphinx_design",
+    "sphinx_tabs.tabs",
+    "sphinx_copybutton"
 ]
 
 # Mock modules that might not be available during documentation build
@@ -106,6 +106,10 @@ def get_version():
 
 html_theme = "sphinx_book_theme"
 html_theme_options = {
+    "logo": {
+        "image_light": "_static/image.png",
+        "image_dark": "_static/image.png",
+    },
     "repository_url": "https://github.com/aws/sagemaker-hyperpod-cli",
     "use_repository_button": True,
     "use_issues_button": True,
 
@@ -0,0 +1,120 @@
+(getting_started)=
+
+# Getting Started
+
+This guide will help you get started with the SageMaker HyperPod CLI and SDK to perform basic operations.
+
+## Cluster Management
+
+### List Available Clusters
+
+List all available SageMaker HyperPod clusters in your account:
+
+**CLI**
+```bash
+hyp list-cluster [--region <region>] [--namespace <namespace>] [--output <json|table>]
+```
+
+**SDK**
+```python
+from sagemaker.hyperpod.hyperpod_manager import HyperPodManager
+
+clusters = HyperPodManager.list_clusters(region='us-east-2')
+print(clusters)
+```
+
+**Parameters:**
+- `region` (string) - Optional. The AWS region where the SageMaker HyperPod and EKS clusters are located. If not specified, uses the region from your current AWS account credentials.
+- `namespace` (string) - Optional. The namespace to check quota with. Only SageMaker managed namespaces are supported.
+- `output` (enum) - Optional. The output format: `table` or `json` (default).
+
+### Connect to a Cluster
+
+Configure your local kubectl environment to interact with a specific SageMaker HyperPod cluster and namespace:
+
+**CLI**
+```bash
+hyp set-cluster-context --cluster-name <cluster-name> [--namespace <namespace>]
+```
+
+**SDK**
+```python
+from sagemaker.hyperpod.hyperpod_manager import HyperPodManager
+
+HyperPodManager.set_context('<hyperpod-cluster-name>', region='us-east-2')
+```
+
+**Parameters:**
+- `cluster-name` (string) - Required. The SageMaker HyperPod cluster name to configure with.
+- `namespace` (string) - Optional. The namespace to connect to. If not specified, the CLI will automatically discover accessible namespaces.
+
+### Get Current Cluster Context
+
+View information about the currently configured cluster context:
+
+**CLI**
+```bash
+hyp get-cluster-context
+```
+
+**SDK**
+```python
+from sagemaker.hyperpod.hyperpod_manager import HyperPodManager
+
+# Get current context information
+context = HyperPodManager.get_context()
+print(context)
+```
+
+## Job Management
+
+### List Pods for a Training Job
+
+View all pods associated with a specific training job:
+
+**CLI**
+```bash
+hyp list-pods hyp-pytorch-job --job-name <job-name>
+```
+
+**SDK**
+```python
+# List all pods created for this job
+pytorch_job.list_pods()
+```
+
+**Parameters:**
+- `job-name` (string) - Required. The name of the job to list pods for.
+
+### Access Pod Logs
+
+View logs for a specific pod within a training job:
+
+**CLI**
+```bash
+hyp get-logs hyp-pytorch-job --pod-name <pod-name> --job-name <job-name>
+```
+
+**SDK**
+```python
+# Check the logs from pod0
+pytorch_job.get_logs_from_pod("demo-pod-0")
+```
+
+**Parameters:**
+- `job-name` (string) - Required. The name of the job to get logs for.
+- `pod-name` (string) - Required. The name of the pod to get logs from.
+
+## Next Steps
+
+After setting up your environment and connecting to a cluster, you can:
+
+- Create and manage PyTorch training jobs
+- Deploy and manage inference endpoints
+- Monitor cluster resources and job performance
+
+For more detailed information on specific commands, use the `--help` flag:
+
+```bash
+hyp <command> --help
+```
@@ -2,13 +2,14 @@
 
 # SageMaker HyperPod CLI and SDK Documentation
 
-**Version**: {{ version }}
-
 ```{toctree}
 :hidden:
 :maxdepth: 1
 
+Installation <installation>
 Getting Started <getting_started>
+Training <training>
+Inference <inference>
 API reference <_apidoc/modules>
 ```
 
@@ -19,7 +20,7 @@ SageMaker HyperPod CLI and SDK provide a seamless way to manage distributed trai
 :gutter: 3
 
 :::{grid-item-card} Installation
-:link: getting_started
+:link: installation
 :link-type: ref
 
 Get the CLI/ SDK setup
@@ -33,14 +34,14 @@ Beginner's guide to using CLI/ SDK
 :::
 
 :::{grid-item-card} Training
-:link: getting_started
+:link: training
 :link-type: ref
 
 Detailed guide on creating Pytorch training jobs
 :::
 
 :::{grid-item-card} Inference
-:link: getting_started
+:link: inference
 :link-type: ref
 
 Detailed guide on creating, invoking and monitoring endpoints
 
@@ -0,0 +1,198 @@
+(inference)=
+
+# Inference with SageMaker HyperPod
+
+SageMaker HyperPod provides powerful capabilities for deploying and managing inference endpoints on EKS-hosted clusters. This guide covers how to create, invoke, and manage inference endpoints using both the HyperPod CLI and SDK.
+
+## Overview
+
+SageMaker HyperPod inference endpoints allow you to:
+
+- Deploy pre-trained JumpStart models
+- Deploy custom models with your own inference code
+- Configure resource requirements for inference
+- Manage endpoint lifecycle
+- Invoke endpoints for real-time predictions
+- Monitor endpoint performance
+
+## Creating Inference Endpoints
+
+You can create inference endpoints using either JumpStart models or custom models:
+
+### JumpStart Model Endpoints
+
+**CLI**
+```bash
+hyp create hyp-jumpstart-endpoint \
+    --version 1.0 \
+    --model-id jumpstart-model-id \
+    --instance-type ml.g5.8xlarge \
+    --endpoint-name endpoint-jumpstart \
+    --tls-output-s3-uri s3://sample-bucket
+```
+
+**SDK**
+```python
+from sagemaker.hyperpod.inference import HyperPodJumpstartEndpoint
+
+# Create a JumpStart endpoint
+endpoint = HyperPodJumpstartEndpoint(
+    endpoint_name="endpoint-jumpstart",
+    model_id="jumpstart-model-id",
+    instance_type="ml.g5.8xlarge",
+    tls_output_s3_uri="s3://sample-bucket"
+)
+
+# Deploy the endpoint
+endpoint.create()
+```
+
+### Custom Model Endpoints
+
+**CLI**
+```bash
+hyp create hyp-custom-endpoint \
+    --version 1.0 \
+    --endpoint-name endpoint-custom \
+    --model-uri s3://my-bucket/model-artifacts \
+    --image 123456789012.dkr.ecr.us-west-2.amazonaws.com/my-inference-image:latest \
+    --instance-type ml.g5.8xlarge \
+    --tls-output-s3-uri s3://sample-bucket
+```
+
+**SDK**
+```python
+from sagemaker.hyperpod.inference import HyperPodCustomEndpoint
+
+# Create a custom endpoint
+endpoint = HyperPodCustomEndpoint(
+    endpoint_name="endpoint-custom",
+    model_uri="s3://my-bucket/model-artifacts",
+    image="123456789012.dkr.ecr.us-west-2.amazonaws.com/my-inference-image:latest",
+    instance_type="ml.g5.8xlarge",
+    tls_output_s3_uri="s3://sample-bucket"
+)
+
+# Deploy the endpoint
+endpoint.create()
+```
+
+## Key Parameters
+
+When creating an inference endpoint, you'll need to specify:
+
+- **endpoint-name**: Unique identifier for your endpoint
+- **model-id** (JumpStart): ID of the pre-trained JumpStart model
+- **model-uri** (Custom): S3 location of your model artifacts
+- **image** (Custom): Docker image containing your inference code
+- **instance-type**: The EC2 instance type to use
+- **tls-output-s3-uri**: S3 location to store TLS certificates
+
+## Managing Inference Endpoints
+
+### List Endpoints
+
+**CLI**
+```bash
+# List JumpStart endpoints
+hyp list hyp-jumpstart-endpoint
+
+# List custom endpoints
+hyp list hyp-custom-endpoint
+```
+
+**SDK**
+```python
+from sagemaker.hyperpod.inference import HyperPodJumpstartEndpoint, HyperPodCustomEndpoint
+
+# List JumpStart endpoints
+jumpstart_endpoints = HyperPodJumpstartEndpoint.list()
+print(jumpstart_endpoints)
+
+# List custom endpoints
+custom_endpoints = HyperPodCustomEndpoint.list()
+print(custom_endpoints)
+```
+
+### Describe an Endpoint
+
+**CLI**
+```bash
+# Describe JumpStart endpoint
+hyp describe hyp-jumpstart-endpoint --endpoint-name <endpoint-name>
+
+# Describe custom endpoint
+hyp describe hyp-custom-endpoint --endpoint-name <endpoint-name>
+```
+
+**SDK**
+```python
+from sagemaker.hyperpod.inference import HyperPodJumpstartEndpoint, HyperPodCustomEndpoint
+
+# Get JumpStart endpoint details
+jumpstart_endpoint = HyperPodJumpstartEndpoint.load(endpoint_name="endpoint-jumpstart")
+jumpstart_details = jumpstart_endpoint.describe()
+print(jumpstart_details)
+
+# Get custom endpoint details
+custom_endpoint = HyperPodCustomEndpoint.load(endpoint_name="endpoint-custom")
+custom_details = custom_endpoint.describe()
+print(custom_details)
+```
+
+### Invoke an Endpoint
+
+**CLI**
+```bash
+# Invoke custom endpoint
+hyp invoke hyp-custom-endpoint \
+    --endpoint-name <endpoint-name> \
+    --content-type "application/json" \
+    --payload '{"inputs": "What is machine learning?"}'
+```
+
+**SDK**
+```python
+from sagemaker.hyperpod.inference import HyperPodCustomEndpoint
+
+# Load the endpoint
+endpoint = HyperPodCustomEndpoint.load(endpoint_name="endpoint-custom")
+
+# Invoke the endpoint
+response = endpoint.invoke(
+    payload={"inputs": "What is machine learning?"},
+    content_type="application/json"
+)
+print(response)
+```
+
+### Delete an Endpoint
+
+**CLI**
+```bash
+# Delete JumpStart endpoint
+hyp delete hyp-jumpstart-endpoint --endpoint-name <endpoint-name>
+
+# Delete custom endpoint
+hyp delete hyp-custom-endpoint --endpoint-name <endpoint-name>
+```
+
+**SDK**
+```python
+from sagemaker.hyperpod.inference import HyperPodJumpstartEndpoint, HyperPodCustomEndpoint
+
+# Delete JumpStart endpoint
+jumpstart_endpoint = HyperPodJumpstartEndpoint.load(endpoint_name="endpoint-jumpstart")
+jumpstart_endpoint.delete()
+
+# Delete custom endpoint
+custom_endpoint = HyperPodCustomEndpoint.load(endpoint_name="endpoint-custom")
+custom_endpoint.delete()
+```
+
+## Inference Example Notebooks
+
+For detailed examples of inference with HyperPod, see:
+- [CLI Inference FSX Model Example](https://github.com/aws/sagemaker-hyperpod-cli/blob/main/examples/inference/CLI/inference-fsx-model-e2e-cli.ipynb)
+- [CLI Inference Jumpstart Model Example](https://github.com/aws/sagemaker-hyperpod-cli/blob/main/examples/inference/CLI/inference-jumpstart-e2e-cli.ipynb)
+- [CLI Inference S3 Model Example](https://github.com/aws/sagemaker-hyperpod-cli/blob/main/examples/inference/CLI/inference-s3-model-e2e-cli.ipynb)