Strive for the Best: Reproducible Stream Processing Benchmark to compare Apache Spark and Apache Flink on Cloud

Thursday, February 4, 2016

Reproducible Stream Processing Benchmark to compare Apache Spark and Apache Flink on Cloud

Stream processing is becoming a crucial requirement with the high volume of data generated and the need for real-time processing of those data. And the data processing platforms are trying to provide smart and efficient approaches for stream processing.

Yahoo has recently published a stream processing benchmark and has published the resources here to run the experiment in a single node. Since benchmarking stream processing is an interesting and important task, we wanted to reproduce the experiment on a clustered setup.

We created a reproducible experiment on Amazon EC2 to reproduce yahoo streaming benchmark on a cluster of Apache Spark and Apache Flink. You can find all the resources and in instruction here which will help you to reproduce our experiment. And more importantly, you can reproduce the experiment with different configurations conveniently by following the instruction in above-mentioned link.

Following are some of the application-level and system-level performance results that we obtained during the experiment.

Application-level performance:

System-level performance

Memory:

CPU:

The configurations and explanation of the results can be found in the stream processing evaluation section of full report.

We have completed a performance comparison of batch processing as well for Apache Spark and Apache Flink to reproduce DongWong’s performance comparison.

The full project report can be found here.

Acknowledgement

  Jim Dowling
  Kamal Hakinzaheh
  Shelan Perera

12 comments:

Street viewOctober 4, 2018 at 9:22 PM
Wonderful post !!! Genuinely loved this kind of post.
- Lenny Face
ReplyDelete
Replies
rohitDecember 19, 2018 at 1:47 AM
great post thanks for sharing
KissAnime
ReplyDelete
Replies
UnknownFebruary 17, 2019 at 7:14 AM
Video streaming is likewise represented by different conventions that bring under thought specialized execution, quality issues, dependability, cost elements, and lawful and social issues.best iptv service 2019
ReplyDelete
Replies
marksonDecember 9, 2019 at 5:52 AM
Prescient information investigation utilizes this data to channel current just as verifiable information to recognize examples and figure occasions and conditions that may happen later on at a specific time, given gave parameters.
Data Analytics Course in Bangalore
ReplyDelete
Replies
rama venkataFebruary 24, 2020 at 3:21 AM
nice article!! try to create an article on digital marketing and post. that would also help people in doing the course.and one institute is good in offering certified courses that is 360DigiTMG.if u want u can refer that institute.
ReplyDelete
Replies
AnanadJuly 14, 2020 at 6:13 AM
This comment has been removed by the author.
ReplyDelete
Replies
shezankhatriJuly 30, 2020 at 11:18 AM
Easily, the article is actually the best topic on this registry related issue. I fit in with your conclusions and will eagerly look forward to your next updates. gostream
ReplyDelete
Replies
Cho co October 25, 2020 at 10:56 AM
This is not a position of power or a title, but a role for somebody who has the right mindset, attitude and ambition to change things, and who wants to make a difference and to set new standards. Salesforce training in Hyderabad
ReplyDelete
Replies
Ramesh SampangiApril 30, 2021 at 5:15 AM
Really nice article. I appreciated your effort in this article. I bookmarked your website for further posts. Keep sharing!
Data Science Course in Hyderabad
ReplyDelete
Replies
360DigiTMGMarch 14, 2022 at 12:22 AM
wow, great, I was wondering how to cure acne naturally. I found your site on Google, learned a lot, and now I'm a bit clearer. I’ve bookmarked your site and also added rss. keep us updated.
data science course in hyderabad
ReplyDelete
Replies
AnonymousMay 31, 2022 at 11:55 AM
smm panel
Smm Panel
HTTPS://İSİLANLARİBLOG.COM/
instagram takipçi satın al
HİRDAVATCİ BURADA
BEYAZESYATEKNİKSERVİSİ.COM.TR
SERVİS
Tiktok Para Hilesi
ReplyDelete
Replies
AnonymousJune 2, 2022 at 9:16 AM
kartal samsung klima servisi
beykoz arçelik klima servisi
üsküdar arçelik klima servisi
tuzla vestel klima servisi
tuzla bosch klima servisi
tuzla arçelik klima servisi
çekmeköy samsung klima servisi
ataşehir samsung klima servisi
çekmeköy mitsubishi klima servisi
ReplyDelete
Replies

Subscribe to: Post Comments (Atom)