Learn how Responsive replaced embedded RocksDB with ScyllaDB in Kafka Streams, simplifying the architecture and unlocking massive availability and scale. The talk covers unbundling stream processors, key ScyllaDB features tested, and lessons learned from the transition.
ScyllaDB is No Longer "Just a Faster Cassandra" by Felipe Cardeneti MendesScyllaDB
?
Predictable performance isn't just a feature¡ªit's a must for anyone building serious, data-intensive applications at scale. ScyllaDB has often been labeled merely as a faster alternative to Cassandra, but is that all there is to it? In this talk, let's discuss the results of our head-to-head comparison between the long-delayed Apache Cassandra 5.0 release versus ScyllaDB and explain what truly sets ScyllaDB apart beyond just speed.
Efficient Deduplication in ReSys at Scale: How We Managed to Reduce Server Co...ScyllaDB
?
Efficient deduplication is key to delivering relevant recommendations at scale. Andrei & Ashish will share how they optimized for 300M+ users, cutting costs by 90% while reducing latency. They cover how ScyllaDB, Apache Flink, Redpanda & Google Dataflow powered this evolution.
Deploying ScyllaDB as an On-Premise Replacement to DynamoDB by Chuck WilliamsScyllaDB
?
Beckman Coulter Life Sciences needed to move an AWS-based app on-prem for compliance. To replace DynamoDB, they chose ScyllaDB¡ªenabling a seamless switch with no code changes using the AWS Java SDK. This session covers the proof-of-concept that made it possible.
Scaling Cron at Slack by Claire Adams, SlackScyllaDB
?
Scaling cron at Slack: From a single node to a hyperscale distributed system. Learn how Slack evolved their infrastructure to handle high-volume workloads while ensuring reliability and maintainability.
Revolutionizing Sleep: Scaling IoT Telemetry to 30+ Billion Daily Events by D...ScyllaDB
?
Sleep Number processes 30B+ sensor events daily to enhance sleep technology. This talk details a journey in scaling an IoT telemetry pipeline to handle exponential data growth, real-time processing, and high reliability¡ªadvancing sleep science for millions.
There¡¯s a common adage that it takes 10 years to develop a file system. As ScyllaDB reaches that 10 year milestone in 2025, it¡¯s the perfect time to reflect on the last decade of ScyllaDB development ¨C both hits and misses. It¡¯s especially appropriate given that our project just reached a critical mass with certain scalability and elasticity goals that we dreamed up years ago. This talk will cover how we arrived at ScyllaDB X Cloud achieving our initial vision, and share where we¡¯re heading next.
Feature Store Evolution Under Cost Constraints: When Cost is Part of the Arch...ScyllaDB
?
At P99 CONF 23, ShareChat tackled scaling its ML Feature Store to 1B features/sec¡ªthen had to cut costs while maintaining SLAs. Ivan & David share how they optimized compute, reduced waste in Kubernetes, and tackled autoscaling for Apache Flink. It's geared for anyone interested in ML Feature Store and/or cloud cost optimizations.
Read- and Write-Optimization in Modern Database Infrastructures by Dzejla Med...ScyllaDB
?
Learn more about the data structures behind large databases. Explore B-trees, B^eps-trees, and LSM-trees¡ªhow they optimize reads and writes, their tradeoffs, and how they work well under different scenarios.
Securely Serving Millions of Boot Artifacts a Day by Joa?o Pedro Lima & Matt ...ScyllaDB
?
Cloudflare¡¯s boot infrastructure dynamically generates and signs boot artifacts for nodes worldwide, ensuring secure, scalable, and customizable deployments. This talk dives into its architecture, scaling decisions, and how it enables seamless testing while maintaining a strong chain of trust.
Gmetrics: Processing Metrics at Uber Scale by Cristian VelazquezScyllaDB
?
Gmetrics is the new library that Uber uses to process metrics that addresses many of the pain points around processing metrics as scale for Uber. Cristian provides an inside look at the migration, which involved rewriting everything for Java and Go. He also shares how they are doing this migration safely and transparently for users.
30B Images and Counting: Scaling Canva's Content-Understanding Pipelines by K...ScyllaDB
?
Scaling content understanding for billions of images is no easy feat. This talk dives into building extreme label classification models, balancing accuracy & speed, and optimizing ML pipelines for scale. You'll learn new ways to tackle real-time performance challenges in massive data environments.
Evolving Atlassian Confluence Cloud for Scale, Reliability, and Performance b...ScyllaDB
?
Explore how Confluence Cloud scaled to handle billions of requests while ensuring performance & reliability. Learn about microservice sharding, dependency scaling, failure isolation, and optimizing metrics for real customer impact.
Route It Like It¡¯s Hot: Scaling Payments Routing at American Express by Benja...ScyllaDB
?
Join the American Express Payment Acquiring and Network team as they share insights on building their Global Transaction Router, which powers payment routing at Amex scale. Learn how they design, build, and operate it to handle record-breaking shopping days, ticket sales, and unpredictable demand.
ScyllaDB¡¯s Monstrous Engineering Advances by Avi KivityScyllaDB
?
Join ScyllaDB CTO Avi as he highlights the engineering milestones achieved over the past year, diving into new features, performance improvements, and innovations. Plus, get an exclusive sneak peek into what's coming next for ScyllaDB and the future of high-performance databases.
The Future of Repair: Transparent and Incremental by Botond De?nesScyllaDB
?
Regularly run repairs are essential to keep clusters healthy, yet having a good repair schedule is more challenging than it should be. Repairs often take a long time, preventing running them often. This has an impact on data consistency and also limits the usefulness of the new repair based tombstone garbage collection. We want to address these challenges by making repairs incremental and allowing for automatic repair scheduling, without relying on external tools.
How We Boosted ScyllaDB Data Streaming by 25x by Asias HeScyllaDB
?
Streaming, the process of scaling out/in to other nodes used to analyze every partition, one-by-one and was too slow and depended on the schema. File based stream is a new feature that optimizes tablet movement significantly. It streams the entire SSTable files without deserializing SSTable files into mutation fragments and re-serializing them back into SSTables on receiving nodes. As a result, less data is streamed over the network, and less CPU is consumed, especially for data models that contain small cells.
How Discord Indexes Trillions of Messages: Scaling Search Infrastructure by V...ScyllaDB
?
This talk shares how Discord scaled their message search infrastructure using Rust, Kubernetes, and a multi-cluster Elasticsearch architecture to achieve better performance, operability, and reliability, while also enabling new search features for Discord users.
Inside Freshworks' Migration from Cassandra to ScyllaDB by Premkumar PatturajScyllaDB
?
Freshworks migrated from Cassandra to ScyllaDB to handle growing audit log data efficiently. Cassandra required frequent scaling, complex repairs, and had non-linear scaling. ScyllaDB reduced costs with fewer machines and improved operations. Using Zero Downtime Migration (ZDM), they bulk-migrated data, performed dual writes, and validated consistency.
Data Structures Handling Trillions of Daily Streaming Events by Evan ChanScyllaDB
?
Conviva built a high-performance streaming analytics pipeline processing trillions of events daily¡ªthen made it programmable for diverse workloads. This talk covers how they boosted throughput & memory efficiency 5x+ with Rust, zero-copy data, better execution models & smarter data structures.
Building a Scalable Event-Driven Architecture for Open Finance Brasil by Thi...ScyllaDB
?
Learn about PCM's journey to an event-driven, serverless architecture that empowers Open Finance Brasil to process billion daily reports with cost efficiency using AWS SQS, DynamoDB, S3, Firehose, ElasticCache, EKS with KEDA and Karpenter, Node.js, and Nest.js.
How Discord Performs Database Upgrades at Scale by Ethan DonowitzScyllaDB
?
Discord relies on ScyllaDB to serve millions of reads per second across many clusters, so they needed a comprehensive strategy to sufficiently de-risk upgrades to avoid impact to our users. To accomplish this, they use what they call ¡°shadow clusters.¡± This talk explains how testing with shadow clusters has been paramount to de-risking complicated upgrades for one of the most important pieces of infrastructure at Discord.
Telemetry Showdown: Fluent Bit vs. OpenTelemetry Collector by Henrik RexedScyllaDB
?
Fluent Bit or OpenTelemetry Collector¡ªwhich performs better? With OpenTelemetry unifying observability, Fluent Bit has expanded beyond logs. This session explores key differences, benchmark results, and insights to help you choose the best agent for your cloud-native stack.
Pushing Your Streaming Platform to the Limit by Elad LeevScyllaDB
?
Join Elad for a hands-on session on Chaos Engineering for streaming platforms like Kafka, Pulsar, NATS, and RabbitMQ. Learn to stress test, benchmark, and fine-tune performance to ensure your system stays resilient under pressure.
Overcome Redis Cluster Scale Bottlenecks with ScyllaDB & EloqKV by Hubert ZhangScyllaDB
?
Introducing EloqKV¡ªa new project built on ScyllaDB! It delivers sub-millisecond read/write performance, full transactions, and a Redis-compatible API, acting as a fast, consistent cache. Scale beyond Redis Cluster limits with ScyllaDB¡¯s massive scalability and resilience.
Caching for Performance Masterclass: Caching at ScaleScyllaDB
?
Weighing caching considerations for use cases with different technical requirements and growth expectations.
- Request coalescing
- Negative sharding
- Rate limiting
- Sharding and scaling
Formal Methods: Whence and Whither? [Martin Fr?nzle Festkolloquium, 2025]Jonathan Bowen
?
Alan Turing arguably wrote the first paper on formal methods 75 years ago. Since then, there have been claims and counterclaims about formal methods. Tool development has been slow but aided by Moore¡¯s Law with the increasing power of computers. Although formal methods are not widespread in practical usage at a heavyweight level, their influence as crept into software engineering practice to the extent that they are no longer necessarily called formal methods in their use. In addition, in areas where safety and security are important, with the increasing use of computers in such applications, formal methods are a viable way to improve the reliability of such software-based systems. Their use in hardware where a mistake can be very costly is also important. This talk explores the journey of formal methods to the present day and speculates on future directions.
? ????? ??????? ????? ?
???????? ??????????? is proud to be a part of the ?????? ????? ???? ???? ??????? (?????) success story! By delivering seamless, secure, and high-speed connectivity, OSWAN has revolutionized e-?????????? ?? ??????, enabling efficient communication between government departments and enhancing citizen services.
Through our innovative solutions, ???????? ?????????? has contributed to making governance smarter, faster, and more transparent. This milestone reflects our commitment to driving digital transformation and empowering communities.
? ?????????? ??????, ?????????? ??????????!
Securely Serving Millions of Boot Artifacts a Day by Joa?o Pedro Lima & Matt ...ScyllaDB
?
Cloudflare¡¯s boot infrastructure dynamically generates and signs boot artifacts for nodes worldwide, ensuring secure, scalable, and customizable deployments. This talk dives into its architecture, scaling decisions, and how it enables seamless testing while maintaining a strong chain of trust.
Gmetrics: Processing Metrics at Uber Scale by Cristian VelazquezScyllaDB
?
Gmetrics is the new library that Uber uses to process metrics that addresses many of the pain points around processing metrics as scale for Uber. Cristian provides an inside look at the migration, which involved rewriting everything for Java and Go. He also shares how they are doing this migration safely and transparently for users.
30B Images and Counting: Scaling Canva's Content-Understanding Pipelines by K...ScyllaDB
?
Scaling content understanding for billions of images is no easy feat. This talk dives into building extreme label classification models, balancing accuracy & speed, and optimizing ML pipelines for scale. You'll learn new ways to tackle real-time performance challenges in massive data environments.
Evolving Atlassian Confluence Cloud for Scale, Reliability, and Performance b...ScyllaDB
?
Explore how Confluence Cloud scaled to handle billions of requests while ensuring performance & reliability. Learn about microservice sharding, dependency scaling, failure isolation, and optimizing metrics for real customer impact.
Route It Like It¡¯s Hot: Scaling Payments Routing at American Express by Benja...ScyllaDB
?
Join the American Express Payment Acquiring and Network team as they share insights on building their Global Transaction Router, which powers payment routing at Amex scale. Learn how they design, build, and operate it to handle record-breaking shopping days, ticket sales, and unpredictable demand.
ScyllaDB¡¯s Monstrous Engineering Advances by Avi KivityScyllaDB
?
Join ScyllaDB CTO Avi as he highlights the engineering milestones achieved over the past year, diving into new features, performance improvements, and innovations. Plus, get an exclusive sneak peek into what's coming next for ScyllaDB and the future of high-performance databases.
The Future of Repair: Transparent and Incremental by Botond De?nesScyllaDB
?
Regularly run repairs are essential to keep clusters healthy, yet having a good repair schedule is more challenging than it should be. Repairs often take a long time, preventing running them often. This has an impact on data consistency and also limits the usefulness of the new repair based tombstone garbage collection. We want to address these challenges by making repairs incremental and allowing for automatic repair scheduling, without relying on external tools.
How We Boosted ScyllaDB Data Streaming by 25x by Asias HeScyllaDB
?
Streaming, the process of scaling out/in to other nodes used to analyze every partition, one-by-one and was too slow and depended on the schema. File based stream is a new feature that optimizes tablet movement significantly. It streams the entire SSTable files without deserializing SSTable files into mutation fragments and re-serializing them back into SSTables on receiving nodes. As a result, less data is streamed over the network, and less CPU is consumed, especially for data models that contain small cells.
How Discord Indexes Trillions of Messages: Scaling Search Infrastructure by V...ScyllaDB
?
This talk shares how Discord scaled their message search infrastructure using Rust, Kubernetes, and a multi-cluster Elasticsearch architecture to achieve better performance, operability, and reliability, while also enabling new search features for Discord users.
Inside Freshworks' Migration from Cassandra to ScyllaDB by Premkumar PatturajScyllaDB
?
Freshworks migrated from Cassandra to ScyllaDB to handle growing audit log data efficiently. Cassandra required frequent scaling, complex repairs, and had non-linear scaling. ScyllaDB reduced costs with fewer machines and improved operations. Using Zero Downtime Migration (ZDM), they bulk-migrated data, performed dual writes, and validated consistency.
Data Structures Handling Trillions of Daily Streaming Events by Evan ChanScyllaDB
?
Conviva built a high-performance streaming analytics pipeline processing trillions of events daily¡ªthen made it programmable for diverse workloads. This talk covers how they boosted throughput & memory efficiency 5x+ with Rust, zero-copy data, better execution models & smarter data structures.
Building a Scalable Event-Driven Architecture for Open Finance Brasil by Thi...ScyllaDB
?
Learn about PCM's journey to an event-driven, serverless architecture that empowers Open Finance Brasil to process billion daily reports with cost efficiency using AWS SQS, DynamoDB, S3, Firehose, ElasticCache, EKS with KEDA and Karpenter, Node.js, and Nest.js.
How Discord Performs Database Upgrades at Scale by Ethan DonowitzScyllaDB
?
Discord relies on ScyllaDB to serve millions of reads per second across many clusters, so they needed a comprehensive strategy to sufficiently de-risk upgrades to avoid impact to our users. To accomplish this, they use what they call ¡°shadow clusters.¡± This talk explains how testing with shadow clusters has been paramount to de-risking complicated upgrades for one of the most important pieces of infrastructure at Discord.
Telemetry Showdown: Fluent Bit vs. OpenTelemetry Collector by Henrik RexedScyllaDB
?
Fluent Bit or OpenTelemetry Collector¡ªwhich performs better? With OpenTelemetry unifying observability, Fluent Bit has expanded beyond logs. This session explores key differences, benchmark results, and insights to help you choose the best agent for your cloud-native stack.
Pushing Your Streaming Platform to the Limit by Elad LeevScyllaDB
?
Join Elad for a hands-on session on Chaos Engineering for streaming platforms like Kafka, Pulsar, NATS, and RabbitMQ. Learn to stress test, benchmark, and fine-tune performance to ensure your system stays resilient under pressure.
Overcome Redis Cluster Scale Bottlenecks with ScyllaDB & EloqKV by Hubert ZhangScyllaDB
?
Introducing EloqKV¡ªa new project built on ScyllaDB! It delivers sub-millisecond read/write performance, full transactions, and a Redis-compatible API, acting as a fast, consistent cache. Scale beyond Redis Cluster limits with ScyllaDB¡¯s massive scalability and resilience.
Caching for Performance Masterclass: Caching at ScaleScyllaDB
?
Weighing caching considerations for use cases with different technical requirements and growth expectations.
- Request coalescing
- Negative sharding
- Rate limiting
- Sharding and scaling
Formal Methods: Whence and Whither? [Martin Fr?nzle Festkolloquium, 2025]Jonathan Bowen
?
Alan Turing arguably wrote the first paper on formal methods 75 years ago. Since then, there have been claims and counterclaims about formal methods. Tool development has been slow but aided by Moore¡¯s Law with the increasing power of computers. Although formal methods are not widespread in practical usage at a heavyweight level, their influence as crept into software engineering practice to the extent that they are no longer necessarily called formal methods in their use. In addition, in areas where safety and security are important, with the increasing use of computers in such applications, formal methods are a viable way to improve the reliability of such software-based systems. Their use in hardware where a mistake can be very costly is also important. This talk explores the journey of formal methods to the present day and speculates on future directions.
? ????? ??????? ????? ?
???????? ??????????? is proud to be a part of the ?????? ????? ???? ???? ??????? (?????) success story! By delivering seamless, secure, and high-speed connectivity, OSWAN has revolutionized e-?????????? ?? ??????, enabling efficient communication between government departments and enhancing citizen services.
Through our innovative solutions, ???????? ?????????? has contributed to making governance smarter, faster, and more transparent. This milestone reflects our commitment to driving digital transformation and empowering communities.
? ?????????? ??????, ?????????? ??????????!
Transform Your Future with Front-End Development TrainingVtechlabs
?
Kickstart your career in web development with our front-end web development course in Vadodara. Learn HTML, CSS, JavaScript, React, and more through hands-on projects and expert mentorship. Our front-end development course with placement includes real-world training, mock interviews, and job assistance to help you secure top roles like Front-End Developer, UI/UX Developer, and Web Designer.
Join VtechLabs today and build a successful career in the booming IT industry!
Unlock AI Creativity: Image Generation with DALL¡¤EExpeed Software
?
Discover the power of AI image generation with DALL¡¤E, an advanced AI model that transforms text prompts into stunning, high-quality visuals. This presentation explores how artificial intelligence is revolutionizing digital creativity, from graphic design to content creation and marketing. Learn about the technology behind DALL¡¤E, its real-world applications, and how businesses can leverage AI-generated art for innovation. Whether you're a designer, developer, or marketer, this guide will help you unlock new creative possibilities with AI-driven image synthesis.
World Information Architecture Day 2025 - UX at a CrossroadsJoshua Randall
?
User Experience stands at a crossroads: will we live up to our potential to design a better world? or will we be co-opted by ¡°product management¡± or another business buzzword?
Looking backwards, this talk will show how UX has repeatedly failed to create a better world, drawing on industry data from Nielsen Norman Group, Baymard, MeasuringU, WebAIM, and others.
Looking forwards, this talk will argue that UX must resist hype, say no more often and collaborate less often (you read that right), and become a true profession ¡ª in order to be able to design a better world.
THE BIG TEN BIOPHARMACEUTICAL MNCs: GLOBAL CAPABILITY CENTERS IN INDIASrivaanchi Nathan
?
This business intelligence report, "The Big Ten Biopharmaceutical MNCs: Global Capability Centers in India", provides an in-depth analysis of the operations and contributions of the Global Capability Centers (GCCs) of ten leading biopharmaceutical multinational corporations in India. The report covers AstraZeneca, Bayer, Bristol Myers Squibb, GlaxoSmithKline (GSK), Novartis, Sanofi, Roche, Pfizer, Novo Nordisk, and Eli Lilly. In this report each company's GCC is profiled with details on location, workforce size, investment, and the strategic roles these centers play in global business operations, research and development, and information technology and digital innovation.
A Framework for Model-Driven Digital Twin EngineeringDaniel Lehner
?
ºÝºÝߣs from my PhD Defense at Johannes Kepler University, held on Janurary 10, 2025.
The full thesis is available here: https://epub.jku.at/urn/urn:nbn:at:at-ubl:1-83896
UiPath Automation Developer Associate Training Series 2025 - Session 1DianaGray10
?
Welcome to UiPath Automation Developer Associate Training Series 2025 - Session 1.
In this session, we will cover the following topics:
Introduction to RPA & UiPath Studio
Overview of RPA and its applications
Introduction to UiPath Studio
Variables & Data Types
Control Flows
You are requested to finish the following self-paced training for this session:
Variables, Constants and Arguments in Studio 2 modules - 1h 30m - https://academy.uipath.com/courses/variables-constants-and-arguments-in-studio
Control Flow in Studio 2 modules - 2h 15m - https:/academy.uipath.com/courses/control-flow-in-studio
?? For any questions you may have, please use the dedicated Forum thread. You can tag the hosts and mentors directly and they will reply as soon as possible.
Computational Photography: How Technology is Changing Way We Capture the WorldHusseinMalikMammadli
?
? Computational Photography (Computer Vision/Image): How Technology is Changing the Way We Capture the World
He? d¨¹?¨¹nm¨¹s¨¹n¨¹zm¨¹, m¨¹asir smartfonlar v? kameralar nec? bu q?d?r g?z?l g?r¨¹nt¨¹l?r yarad?r? Bunun sirri Computational Fotoqrafiyas?nda(Computer Vision/Imaging) gizlidir¡ª??kill?ri ??km? v? emal etm? ¨¹sulumuzu t?kmill??dir?n, komp¨¹ter elmi il? fotoqrafiyan?n inqilabi birl??m?si.
DealBook of Ukraine: 2025 edition | AVentures CapitalYevgen Sysoyev
?
The DealBook is our annual overview of the Ukrainian tech investment industry. This edition comprehensively covers the full year 2024 and the first deals of 2025.
Field Device Management Market Report 2030 - TechSci ResearchVipin Mishra
?
The Global Field Device Management (FDM) Market is expected to experience significant growth in the forecast period from 2026 to 2030, driven by the integration of advanced technologies aimed at improving industrial operations.
? According to TechSci Research, the Global Field Device Management Market was valued at USD 1,506.34 million in 2023 and is anticipated to grow at a CAGR of 6.72% through 2030. FDM plays a vital role in the centralized oversight and optimization of industrial field devices, including sensors, actuators, and controllers.
Key tasks managed under FDM include:
Configuration
Monitoring
Diagnostics
Maintenance
Performance optimization
FDM solutions offer a comprehensive platform for real-time data collection, analysis, and decision-making, enabling:
Proactive maintenance
Predictive analytics
Remote monitoring
By streamlining operations and ensuring compliance, FDM enhances operational efficiency, reduces downtime, and improves asset reliability, ultimately leading to greater performance in industrial processes. FDM¡¯s emphasis on predictive maintenance is particularly important in ensuring the long-term sustainability and success of industrial operations.
For more information, explore the full report: https://shorturl.at/EJnzR
Major companies operating in Global?Field Device Management Market are:
General Electric Co
Siemens AG
ABB Ltd
Emerson Electric Co
Aveva Group Ltd
Schneider Electric SE
STMicroelectronics Inc
Techno Systems Inc
Semiconductor Components Industries LLC
International Business Machines Corporation (IBM)
#FieldDeviceManagement #IndustrialAutomation #PredictiveMaintenance #TechInnovation #IndustrialEfficiency #RemoteMonitoring #TechAdvancements #MarketGrowth #OperationalExcellence #SensorsAndActuators
Future-Proof Your Career with AI OptionsDianaGray10
?
Learn about the difference between automation, AI and agentic and ways you can harness these to further your career. In this session you will learn:
Introduction to automation, AI, agentic
Trends in the marketplace
Take advantage of UiPath training and certification
In demand skills needed to strategically position yourself to stay ahead
? If you have any questions or feedback, please refer to the "Women in Automation 2025" dedicated Forum thread. You can find there extra details and updates.
https://ncracked.com/7961-2/
Note: >> Please copy the link and paste it into Google New Tab now Download link
Free Download Wondershare Filmora 14.3.2.11147 Full Version - All-in-one home video editor to make a great video.Free Download Wondershare Filmora for Windows PC is an all-in-one home video editor with powerful functionality and a fully stacked feature set. Filmora has a simple drag-and-drop top interface, allowing you to be artistic with the story you want to create.Video Editing Simplified - Ignite Your Story. A powerful and intuitive video editing experience. Filmora 10 hash two new ways to edit: Action Cam Tool (Correct lens distortion, Clean up your audio, New speed controls) and Instant Cutter (Trim or merge clips quickly, Instant export).Filmora allows you to create projects in 4:3 or 16:9, so you can crop the videos or resize them to fit the size you want. This way, quickly converting a widescreen material to SD format is possible.
Backstage Software Templates for Java DevelopersMarkus Eisele
?
As a Java developer you might have a hard time accepting the limitations that you feel being introduced into your development cycles. Let's look at the positives and learn everything important to know to turn Backstag's software templates into a helpful tool you can use to elevate the platform experience for all developers.
5. Kafka A storage backend optimized for storing
ordered ¡°event¡± data
6. Kafka
Kafka Streams
A storage backend optimized for storing
ordered ¡°event¡± data
A library for building event-driven
applications: realtime, responsive & stateful
7. Kafka Streams is Everywhere
Realtime Inference
Walmart uses Kafka Streams
to power fraud detection and
purchase recommendations
8. Kafka Streams is Everywhere
Realtime Inference
Walmart uses Kafka Streams
to power fraud detection and
purchase recommendations
Logistics
Michelin uses Kafka Streams to
handle their tier distribution, ensuring
delivery is tracked in realtime
9. Kafka Streams is Everywhere
Realtime Inference
Walmart uses Kafka Streams
to power fraud detection and
purchase recommendations
Logistics
Michelin uses Kafka Streams to
handle their tier distribution, ensuring
delivery is tracked in realtime
Liquidity Management
Michelin uses Kafka Streams to
handle their tier distribution, ensuring
delivery is tracked in realtime
12. Original Design Goals
Just a Library
Kafka Streams should be easy
to integrate with existing apps.
Complete API
All stream processing use cases
should be possible to write using
Kafka Streams.
13. Original Design Goals
Just a Library
Kafka Streams should be easy
to integrate with existing apps.
Complete API
All stream processing use cases
should be possible to write using
Kafka Streams.
Depend only on Kafka
There should be no dependencies on
external systems (such as HDFS or
YARN).
14. Original Design Goals
Depend only on Kafka
There should be no dependencies on
external systems (such as HDFS or
YARN).
16. Original Design Goals
Availability
Source of truth is in a ¡°changelog¡± topic in Kafka. If
assignment changes, state must be restored.
Flexibility
Dynamic scaling is impractical. Most deployments
are provisioned for peak throughput.
17. Original Design Goals
Availability
Source of truth is in a ¡°changelog¡± topic in Kafka. If
assignment changes, state must be restored.
Flexibility
Dynamic scaling is impractical. Most deployments
are provisioned for peak throughput.
Performance
Di?cult to properly attribute resources to the
stream processing vs. RocksDB storage subsystem
20. What Changed?
2016
Kafka Streams is released with
Apache Kafka 0.10
2017
Cloud databases gain
mainstream adoption
2019
Kubernetes wins out as de
facto orchestration system
21. What Changed?
2016
Kafka Streams is released with
Apache Kafka 0.10
2017
Cloud databases gain
mainstream adoption
2019
Kubernetes wins out as de
facto orchestration system
2020
Kafka Streams popularity soars,
widening possible use cases
22. What Changed?
2016
Kafka Streams is released with
Apache Kafka 0.10
2017
Cloud databases gain
mainstream adoption
2019
Kubernetes wins out as de
facto orchestration system
2020
Kafka Streams popularity soars,
widening possible use cases
Today
Some assumptions guiding the original
Kafka Streams design are outdated
26. Deep Dive: Metronome
Metronome powers billing for companies like
OpenAI, NVIDIA and Databricks using Kafka
Streams.
Mission Critical Feature: Realtime Spend Limits
27. Key Results Migrating
from RocksDB to Scylla
Availability
Going from regular incidents to ¡°not thinking about
Kafka Streams¡±
Throughput Growth
ScyllaDB scaled without hiccup as their data size
and throughput scaled
Scale Potential
Decoupled compute and storage means we¡¯ve been
able to scale number of Kafka Partitions and
ScyllaDB cluster size independently
99.99%
3x
¡Þ
29. Data Model
CREATE TABLE key_value
partitionKey INTEGER,
dataKey BLOB,
dataValue BLOB,
epoch BIGINT,
offset BIGINT,
PRIMARY KEY ((partitionKey),
dataKey);
Raw Data
Since Kafka Streams deals only with serialized
bytes (users supply the serializers) it makes storing
data easy and general purpose
30. Data Model
CREATE TABLE key_value
partitionKey INTEGER,
dataKey BLOB,
dataValue BLOB,
epoch BIGINT,
offset BIGINT,
PRIMARY KEY ((partitionKey),
dataKey);
Primary Key
Using the Kafka partition allows us to implement
LWTs per partition and the data key allows us to
implement e?cient lookups and range scans
31. Data Model
CREATE TABLE key_value
partitionKey INTEGER,
dataKey BLOB,
dataValue BLOB,
epoch BIGINT,
offset BIGINT,
PRIMARY KEY ((partitionKey),
dataKey);
Restoring State
Storing the offset with a sentinel dataKey enables
e?cient client hand-offs in failure scenarios
35. Node B
Node A process Kafka commit
write to
ScyllaDB
Fencing Zombies
36. Node B
Node A process Kafka commit
write to
ScyllaDB
process Kafka commit
write to
ScyllaDB
Fencing Zombies
37. Node B
Node A process Kafka commit
write to
ScyllaDB
process Kafka commit
write to
ScyllaDB
prevents other nodes from
committing to Kafka (but not
writing to Scylla!)
Fencing Zombies
38. Node B
Node A process Kafka commit
write to
ScyllaDB
process Kafka commit
write to
ScyllaDB
write to
ScyllaDB
Fencing Zombies
39. Node B
Node A process Kafka commit
write to
ScyllaDB
process Kafka commit
write to
ScyllaDB
write to
ScyllaDB
Might overwrite data from the
previous ScyllaDB write!
Fencing Zombies
40. Data Model
CREATE TABLE key_value
partitionKey INTEGER,
dataKey BLOB,
dataValue BLOB,
epoch BIGINT,
offset BIGINT,
PRIMARY KEY ((partitionKey),
dataKey);
BEGIN BATCH;
starts an Atomic Batch (not
for speed, but for LWT)
41. Data Model
CREATE TABLE key_value
partitionKey INTEGER,
dataKey BLOB,
dataValue BLOB,
epoch BIGINT,
offset BIGINT,
PRIMARY KEY ((partitionKey),
dataKey);
BEGIN BATCH;
UPDATE key_value
SET epoch = 12
WHERE
partitionKey = 1
dataKey = metadata_key
IF epoch <= 12;
42. Data Model
CREATE TABLE key_value
partitionKey INTEGER,
dataKey BLOB,
dataValue BLOB,
epoch BIGINT,
offset BIGINT,
PRIMARY KEY ((partitionKey),
dataKey);
BEGIN BATCH;
UPDATE key_value
SET epoch = 12
WHERE
partitionKey = 1
dataKey = metadata_key
IF epoch <= 12;
INSERT INTO key_value VALUES ¡;
INSERT INTO key_value VALUES ¡;
INSERT INTO key_value VALUES ¡;
INSERT INTO key_value VALUES ¡;
43. Data Model
CREATE TABLE key_value
partitionKey INTEGER,
dataKey BLOB,
dataValue BLOB,
epoch BIGINT,
offset BIGINT,
PRIMARY KEY ((partitionKey),
dataKey);
BEGIN BATCH;
UPDATE key_value
SET epoch = 12
WHERE
partitionKey = 1
dataKey = metadata_key
IF epoch <= 12;
INSERT INTO key_value VALUES ¡;
INSERT INTO key_value VALUES ¡;
INSERT INTO key_value VALUES ¡;
INSERT INTO key_value VALUES ¡;
APPLY BATCH;
44. Data Model
CREATE TABLE key_value
partitionKey INTEGER,
dataKey BLOB,
dataValue BLOB,
epoch BIGINT,
offset BIGINT,
PRIMARY KEY ((partitionKey),
dataKey);
BEGIN BATCH;
UPDATE key_value
SET epoch = 12
WHERE
partitionKey = 1
dataKey = metadata_key
IF epoch <= 12;
INSERT INTO key_value VALUES ¡;
INSERT INTO key_value VALUES ¡;
INSERT INTO key_value VALUES ¡;
INSERT INTO key_value VALUES ¡;
APPLY BATCH;
46. Latency Issues? Check Disk!
Bloom Filter Usage
An increase in disk reads is often a symptom
something else is wrong. For us, there were a few
times Bloom Filters were not properly constructed
(once due to a con?g, once due to a bug)!
47. Use LWT Only When Necessary
Cost of Atomic Batches
Atomic Batches signi?cantly slow down write
throughput, even if they¡¯re contained to only a
single partition
Throughput Before/After Removing LWTs
49. Consistency
Don¡¯t Be Inconsistent!
Make sure you¡¯ve set your Read / Write consistency
levels. The default is ONE / ONE, which can give
inconsistent results!
Favor Fast Reads
Favor Fast Writes
Inconsistent Consistent
ONE/ONE
50. Consistency
Don¡¯t Be Inconsistent!
Make sure you¡¯ve set your Read / Write consistency
levels. The default is ONE / ONE, which can give
inconsistent results!
Favor Fast Reads
Favor Fast Writes
Inconsistent Consistent
QUORUM/
QUORUM
ONE/ONE
51. Consistency
Don¡¯t Be Inconsistent!
Make sure you¡¯ve set your Read / Write consistency
levels. The default is ONE / ONE, which can give
inconsistent results!
Favor Fast Reads
Favor Fast Writes
Inconsistent Consistent
QUORUM/
QUORUM
ONE/ONE
ONE/ALL
risky, but has niche
use cases
53. Consistency
QUORUM/QUORUM
Need Faster Reads
Option: use ONE/ALL temporarily
Migrate to ALL / ALL
This is an intermediate state necessary
to maintain consistency
Run Repair
Ensure all data is available on all nodes
from the QUORUM / QUORUM time
54. Consistency
QUORUM/QUORUM
Need Faster Reads
Option: use ONE/ALL temporarily
Migrate to ALL / ALL
This is an intermediate state necessary
to maintain consistency
Run Repair
Ensure all data is available on all nodes
from the QUORUM / QUORUM time
Enable ONE / ALL
Speedy reads! This can help you
temporarily and migrating back to
QUORUM / QUORUM is safe.
55. Stay in Touch
Almog Gavra
almog@responsive.dev
@almog.ai
@agavra
/in/agavra/