Used to be called “Kinesis Data Firehose”

A fully managed service

  • Amazon Redshift / Amazon S3 / Amazon OpenSearch

  • 3rd party: Splunk / MongoDB / Datadog / NewRelic

  • Custom HTTP endpoint

  • Automatic scaling, serverless, pay for what you use

  • Near real-time with buffering capability based on size/time

  • Supports CSV, JSON, Parquet, Avro, Raw Text, Binary data

  • Custom data transformation using Lambda

Kinesis Data Stream vs Amazon Data Firehose

Kinesis Data StreamAmazon Data Firehose
Streaming data collectionLoad streaming data into S3, Redshift, 3rd party …
Producer & consumer codeFully managed
Real-timeNear real-time
Provisioned / On-demand modeAutomatic scaling
Data storage up to 365 daysNo data storage
Replay capabilityNo replay