Explain the level of parallelism in spark streaming.

This topic has 1 reply, 1 voice, and was last updated 7 years, 10 months ago by DataFlair Team.

Viewing 1 reply thread

Author

Posts
- September 20, 2018 at 3:01 pm #5390
  
  DataFlair Team
  Spectator
  
  Explain the level of parallelism in spark streaming.
- September 20, 2018 at 3:01 pm #5391
  
  DataFlair Team
  Spectator
  
  > In order to reduce the processing time, one need to increase the parallelism.
  > In Spark Streaming, there are three ways to increase the parallelism :
  (1) Increase the number of receivers : If there are too many records for single receiver (single machine) to read in and distribute so that is bottleneck. So we can increase the no. of receiver depends on scenario.
  (2) Re-partition the receive data : If one is not in a position to increase the no. of receivers in that case redistribute the data by re-partitioning.
  (3) Increase parallelism in aggregation :
  
  for complete guide on Spark Streaming you may refer to Apache Spark-Streaming guide
Author

Posts

Viewing 1 reply thread

You must be logged in to reply to this topic.