dataflow for a classic analysis job

To run a analysis job we first need to collect the data, either via message queue system or via log ingestion.

After getting the original data we have two stacks to analyze them, one is batch based analysis, another is streaming based analysis.

The dataflow looks similar as below.

The tech stacks include: