EndpointConfig

sagemaker.services.k8s.aws/v1alpha1

TypeLink
GoDocsagemaker-controller/apis/v1alpha1#EndpointConfig

Metadata

PropertyValue
ScopeNamespaced
KindEndpointConfig
ListKindEndpointConfigList
Pluralendpointconfigs
Singularendpointconfig

Spec

asyncInferenceConfig: 
  clientConfig: 
    maxConcurrentInvocationsPerInstance: integer
  outputConfig: 
    kmsKeyID: string
    notificationConfig: 
      errorTopic: string
      successTopic: string
    s3OutputPath: string
dataCaptureConfig: 
  captureContentTypeHeader: 
    csvContentTypes:
    - string
    jsonContentTypes:
    - string
  captureOptions:
  - captureMode: string
  destinationS3URI: string
  enableCapture: boolean
  initialSamplingPercentage: integer
  kmsKeyID: string
endpointConfigName: string
kmsKeyID: string
productionVariants:
- acceleratorType: string
  containerStartupHealthCheckTimeoutInSeconds: integer
  coreDumpConfig: 
    destinationS3URI: string
    kmsKeyID: string
  enableSSMAccess: boolean
  initialInstanceCount: integer
  initialVariantWeight: number
  instanceType: string
  modelDataDownloadTimeoutInSeconds: integer
  modelName: string
  serverlessConfig: 
    maxConcurrency: integer
    memorySizeInMB: integer
  variantName: string
  volumeSizeInGB: integer
tags:
- key: string
  value: string
FieldDescription
asyncInferenceConfig
Optional
object
Specifies configuration for how an endpoint performs asynchronous inference. This is a required field in order for your Endpoint to be invoked using InvokeEndpointAsync (https://docs.aws.amazon.com/sagemaker/latest/APIReference/API_runtime_InvokeEndpointAsync.html).
asyncInferenceConfig.clientConfig
Optional
object
Configures the behavior of the client used by SageMaker to interact with the model container during asynchronous inference.
asyncInferenceConfig.clientConfig.maxConcurrentInvocationsPerInstance
Optional
integer
asyncInferenceConfig.outputConfig
Optional
object
Specifies the configuration for asynchronous inference invocation outputs.
asyncInferenceConfig.outputConfig.kmsKeyID
Optional
string
asyncInferenceConfig.outputConfig.notificationConfig
Optional
object
Specifies the configuration for notifications of inference results for asynchronous inference.
asyncInferenceConfig.outputConfig.notificationConfig.errorTopic
Optional
string
asyncInferenceConfig.outputConfig.notificationConfig.successTopic
Optional
string
asyncInferenceConfig.outputConfig.s3OutputPath
Optional
string
dataCaptureConfig
Optional
object
Configuration to control how SageMaker captures inference data.
dataCaptureConfig.captureContentTypeHeader
Optional
object
Configuration specifying how to treat different headers. If no headers are specified SageMaker will by default base64 encode when capturing the data.
dataCaptureConfig.captureContentTypeHeader.csvContentTypes
Optional
array
dataCaptureConfig.captureContentTypeHeader.csvContentTypes.[]
Required
string
dataCaptureConfig.captureContentTypeHeader.jsonContentTypes.[]
Required
string
dataCaptureConfig.captureOptions.[]
Required
object
Specifies data Model Monitor will capture.
dataCaptureConfig.destinationS3URI
Optional
string
dataCaptureConfig.enableCapture
Optional
boolean
dataCaptureConfig.initialSamplingPercentage
Optional
integer
dataCaptureConfig.kmsKeyID
Optional
string
endpointConfigName
Required
string
The name of the endpoint configuration. You specify this name in a CreateEndpoint request.
kmsKeyID
Optional
string
The Amazon Resource Name (ARN) of a Amazon Web Services Key Management Service key that SageMaker uses to encrypt data on the storage volume attached to the ML compute instance that hosts the endpoint.
The KmsKeyId can be any of the following formats:
* Key ID: 1234abcd-12ab-34cd-56ef-1234567890ab
* Key ARN: arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab
* Alias name: alias/ExampleAlias
* Alias name ARN: arn:aws:kms:us-west-2:111122223333:alias/ExampleAlias
The KMS key policy must grant permission to the IAM role that you specify in your CreateEndpoint, UpdateEndpoint requests. For more information, refer to the Amazon Web Services Key Management Service section Using Key Policies in Amazon Web Services KMS (https://docs.aws.amazon.com/kms/latest/developerguide/key-policies.html)
Certain Nitro-based instances include local storage, dependent on the instance type. Local storage volumes are encrypted using a hardware module on the instance. You can’t request a KmsKeyId when using an instance type with local storage. If any of the models that you specify in the ProductionVariants parameter use nitro-based instances with local storage, do not specify a value for the KmsKeyId parameter. If you specify a value for KmsKeyId when using any nitro-based instances with local storage, the call to CreateEndpointConfig fails.
For a list of instance types that support local instance storage, see Instance Store Volumes (https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/InstanceStorage.html#instance-store-volumes).
For more information about local instance storage encryption, see SSD Instance Store Volumes (https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ssd-instance-store.html).
productionVariants
Required
array
An array of ProductionVariant objects, one for each model that you want to host at this endpoint.
productionVariants.[]
Required
object
Identifies a model that you want to host and the resources chosen to deploy for hosting it. If you are deploying multiple models, tell SageMaker how to distribute traffic among the models by specifying variant weights. For more information on production variants, check Production variants (https://docs.aws.amazon.com/sagemaker/latest/dg/model-ab-testing.html).
productionVariants.[].containerStartupHealthCheckTimeoutInSeconds
Optional
integer
productionVariants.[].coreDumpConfig
Optional
object
Specifies configuration for a core dump from the model container when the process crashes.
productionVariants.[].coreDumpConfig.destinationS3URI
Optional
string
productionVariants.[].coreDumpConfig.kmsKeyID
Optional
string
productionVariants.[].enableSSMAccess
Optional
boolean
productionVariants.[].initialInstanceCount
Optional
integer
productionVariants.[].initialVariantWeight
Optional
number
productionVariants.[].instanceType
Optional
string
productionVariants.[].modelDataDownloadTimeoutInSeconds
Optional
integer
productionVariants.[].modelName
Optional
string
productionVariants.[].serverlessConfig
Optional
object
Specifies the serverless configuration for an endpoint variant.
productionVariants.[].serverlessConfig.maxConcurrency
Optional
integer
productionVariants.[].serverlessConfig.memorySizeInMB
Optional
integer
productionVariants.[].variantName
Optional
string
productionVariants.[].volumeSizeInGB
Optional
integer
tags
Optional
array
An array of key-value pairs. You can use tags to categorize your Amazon Web Services resources in different ways, for example, by purpose, owner, or environment. For more information, see Tagging Amazon Web Services Resources (https://docs.aws.amazon.com/general/latest/gr/aws_tagging.html).
tags.[]
Required
object
A tag object that consists of a key and an optional value, used to manage metadata for SageMaker Amazon Web Services resources.
You can add tags to notebook instances, training jobs, hyperparameter tuning jobs, batch transform jobs, models, labeling jobs, work teams, endpoint configurations, and endpoints. For more information on adding tags to SageMaker resources, see AddTags.
For more information on adding metadata to your Amazon Web Services resources with tagging, see Tagging Amazon Web Services resources (https://docs.aws.amazon.com/general/latest/gr/aws_tagging.html). For advice on best practices for managing Amazon Web Services resources with tagging, see Tagging Best Practices: Implement an Effective Amazon Web Services Resource Tagging Strategy (https://d1.awsstatic.com/whitepapers/aws-tagging-best-practices.pdf).
tags.[].value
Optional
string

Status

ackResourceMetadata: 
  arn: string
  ownerAccountID: string
  region: string
conditions:
- lastTransitionTime: string
  message: string
  reason: string
  status: string
  type: string
FieldDescription
ackResourceMetadata
Optional
object
All CRs managed by ACK have a common Status.ACKResourceMetadata member that is used to contain resource sync state, account ownership, constructed ARN for the resource
ackResourceMetadata.arn
Optional
string
ARN is the Amazon Resource Name for the resource. This is a globally-unique identifier and is set only by the ACK service controller once the controller has orchestrated the creation of the resource OR when it has verified that an “adopted” resource (a resource where the ARN annotation was set by the Kubernetes user on the CR) exists and matches the supplied CR’s Spec field values. TODO(vijat@): Find a better strategy for resources that do not have ARN in CreateOutputResponse https://github.com/aws/aws-controllers-k8s/issues/270
ackResourceMetadata.ownerAccountID
Required
string
OwnerAccountID is the AWS Account ID of the account that owns the backend AWS service API resource.
ackResourceMetadata.region
Required
string
Region is the AWS region in which the resource exists or will exist.
conditions
Optional
array
All CRS managed by ACK have a common Status.Conditions member that contains a collection of ackv1alpha1.Condition objects that describe the various terminal states of the CR and its backend AWS service API resource
conditions.[]
Required
object
Condition is the common struct used by all CRDs managed by ACK service controllers to indicate terminal states of the CR and its backend AWS service API resource
conditions.[].message
Optional
string
A human readable message indicating details about the transition.
conditions.[].reason
Optional
string
The reason for the condition’s last transition.
conditions.[].status
Optional
string
Status of the condition, one of True, False, Unknown.
conditions.[].type
Optional
string
Type is the type of the Condition