Type discriminator for identifying the specific content modality.
Provides type-safe discrimination between different content types such as "text", "audio", "image", and "file". This discriminator enables proper type narrowing and ensures that each content type is processed according to its specific characteristics and requirements.
The type field is essential for the multimodal content processing pipeline, allowing the system to route different content types to appropriate handlers while maintaining type safety throughout the conversation flow.
URL or data URL from which the audio bytes can be read.
Audio codec/container format used for the voice input.