Multimodal
Media exchanged with the model — images, audio, voice, transcripts, vision input.
ImageMessage
Image bubble for chat replies with loading, generation progress, error, and ready states.
ImageGallery
Responsive image grid with built-in lightbox, keyboard navigation, and download.
VoiceWaveform
Bar-based audio waveform: live recording, playback progress, or idle baseline.
AudioPlayer
Compact player for TTS / transcribed audio — play, seek, mute, cycle speed, download.
TranscriptionView
Time-anchored transcript with active-segment highlight, auto-scroll, click-to-seek.
VisionInput
Vision-input dropzone — drag-and-drop, file picker, paste, optional camera capture.