interface SemanticChunkingConfigurationProperty
| Language | Type name |
|---|---|
.NET | Amazon.CDK.CfnPropertyMixins.AWS.Wisdom.CfnKnowledgeBasePropsMixin.SemanticChunkingConfigurationProperty |
Go | github.com/aws/aws-cdk-go/awscdkcfnpropertymixins/v2/awswisdom#CfnKnowledgeBasePropsMixin_SemanticChunkingConfigurationProperty |
Java | software.amazon.awscdk.cfnpropertymixins.services.wisdom.CfnKnowledgeBasePropsMixin.SemanticChunkingConfigurationProperty |
Python | aws_cdk.cfn_property_mixins.aws_wisdom.CfnKnowledgeBasePropsMixin.SemanticChunkingConfigurationProperty |
TypeScript | @aws-cdk/cfn-property-mixins » aws_wisdom » CfnKnowledgeBasePropsMixin » SemanticChunkingConfigurationProperty |
Settings for semantic document chunking for a data source.
Semantic chunking splits a document into smaller documents based on groups of similar content derived from the text with natural language processing.
Example
// The code below shows an example of how to instantiate this type.
// The values are placeholders you should change.
import { aws_wisdom as wisdom } from '@aws-cdk/cfn-property-mixins';
const semanticChunkingConfigurationProperty: wisdom.CfnKnowledgeBasePropsMixin.SemanticChunkingConfigurationProperty = {
breakpointPercentileThreshold: 123,
bufferSize: 123,
maxTokens: 123,
};
Properties
| Name | Type | Description |
|---|---|---|
| breakpoint | number | The dissimilarity threshold for splitting chunks. |
| buffer | number | The buffer size. |
| max | number | The maximum number of tokens that a chunk can contain. |
breakpointPercentileThreshold?
Type:
number
(optional)
The dissimilarity threshold for splitting chunks.
bufferSize?
Type:
number
(optional)
The buffer size.
maxTokens?
Type:
number
(optional)
The maximum number of tokens that a chunk can contain.

.NET
Go
Java
Python
TypeScript