Content Chunker
Converts MaterializedContainerSection objects into Chunk objects with intelligent text splitting.
For container sections with small total content (aggregated from leaves), creates a single chunk containing all leaf content. For large leaf sections within containers, splits them individually into multiple chunks.
Parameters
max Chunk Size
Maximum characters per chunk (default: 1500)
overlap Size
Characters of overlap between chunks (default: 200)
min Chunk Size
Minimum characters to warrant splitting (default: 2000)
Types
Link copied to clipboard
data class SplitterConfig(val maxChunkSize: Int = 1500, val overlapSize: Int = 200, val minChunkSize: Int = 2000)
Configuration for the splitter