Hemant GuptaInsights from paper — Twitter Heron: Stream Processing at ScaleAbstract9 min read·Aug 16, 2023----
Hemant GuptaInsights from paper — Google FlumeJava: Easy, Efficient Data-Parallel PipelinesAbstract10 min read·Aug 11, 2023----
Hemant GuptaInsights from papers — Google MapReduce: Simplified Data Processing on Large ClustersAbstract10 min read·Aug 10, 2023----
Hemant GuptaInsights from paper(part II) — Manu: A Cloud Native Vector Database Management SystemIn the previous post for Manu paper, we discussed the basics and architecture of the system. Make sure to go through that before continue…9 min read·Aug 8, 2023----
Hemant GuptaInsights from paper(part I) — Manu: A Cloud Native Vector Database Management SystemData science and AI applications need to manage high-dimensional vector data. Embedding vectors are widely used for analyzing and…15 min read·Aug 7, 2023----
Hemant GuptaInsights from paper (part II) — Google Mesa: Geo Replicated, Near Real-Time, Scalable Data…In the previous post, we learned a few basic concepts of the Google Mesa data warehousing system.8 min read·Aug 6, 2023----
Hemant GuptaInsights from paper (part I) — Google Mesa: Geo Replicated, Near Real-Time, Scalable Data…Mesa is Google’s highly scalable analytic data warehousing system. Mesa is designed for near real-time data ingestion and query ability.9 min read·Aug 5, 2023----
Hemant GuptaInsights from paper — Pinot: Realtime OLAP for 530 Million UsersPinot started inside LinkedIn as an internal project in 2013. Pinot serves tens of thousands of analytical queries per second, offers…13 min read·Aug 4, 2023--1--1
Hemant GuptaInsights from paper (part II) — Druid: A Real-time Analytical Data StoreIn the first part of the article, I have covered the introduction and architecture of Druid paper. I will cover the rest of the paper in…8 min read·Aug 3, 2023----