eBay’s Common Automation Solution for Platform Evolution

[ad_1] For any large online business, the platform is a foundational piece. eBay’s platform contains software frameworks and infrastructure in its backend. Because the platform is so important, updates are essential to keeping the applications — including fundamental operations like search and checkout — stable and reliable. At eBay, there are more than 3,000 site […]

Continue Reading

Performance bottlenecks of Go application on Kubernetes with non-integer (floating) CPU allocation

[ad_1] Grab’s real-time data platform team, Coban, has been running its stream processing framework on Kubernetes, as detailed in Plumbing at scale. We’ve also written another article (Scaling Kafka consumers) about vertical pod autoscaling (VPA) and the benefits of using it. In this article, we cover the performance bottlenecks and other issues we came across […]

Continue Reading

How we improved our iOS CI infrastructure with observability tools

[ad_1] Note: Timestamps used in this article are in UTC+8 Singapore time, unless stated otherwise. Background When we upgraded to Xcode 13.1 in April 2022, we noticed a few issues such as instability of the CI tests and other problems related to the switch to Xcode 13.1.  After taking a step back, we investigated this […]

Continue Reading

Understanding Caching in Databricks SQL: UI, Result, and Disk Caches

[ad_1] Caching is an essential technique for improving the performance of data warehouse systems by avoiding the need to recompute or fetch the same data multiple times. In Databricks SQL, caching can significantly speed up query execution and minimize warehouse usage, resulting in lower costs and more efficient resource utilization. This article will explore the […]

Continue Reading

Announcing the General Availability of Databricks SQL Serverless !

[ad_1] Today, we are thrilled to announce that serverless compute for Databricks SQL is Generally Available on AWS and Azure! Databricks SQL (DB SQL) Serverless provides the best performance with instant and elastic compute, lowers costs, and enables you to focus on delivering the most value to your business rather than managing infrastructure. With GA, […]

Continue Reading

Latency goes subsecond in Apache Spark Structured Streaming

[ad_1] Apache Spark Structured Streaming is the leading open source stream processing platform. It is also the core technology that powers streaming on the Databricks Lakehouse Platform and provides a unified API for batch and stream processing. As the adoption of streaming is growing rapidly, diverse applications want to take advantage of it for real […]

Continue Reading

2.3x faster using the Go plugin to replace Lua virtual machine

[ad_1] Abstract We’re excited to share with you the latest update on our open-source project Talaria. In our efforts to improve performance and overcome infrastructure limitations, we’ve made significant strides by implementing the Go plugin to replace Lua VM. Our team has found that the Go plugin is roughly 2.3x faster and uses 2.3x less […]

Continue Reading

Building Data Applications on the Lakehouse With the Databricks SQL Driver for Node.js

[ad_1] We are excited to announce the general availability of the Databricks SQL Driver for NodeJS. This follows the recent general availability of Databricks SQL Driver for GO and the earlier Databricks SQL Connector for Python. Node.js developers can now easily build data applications on the lakehouse in pure Javascript or TypeScript. The NodeJS driver […]

Continue Reading

Announcing the Public Preview of Predictive I/O for Updates

[ad_1] Previously, we’ve shown you how a new technology called Predictive I/O could improve selective reads by up to 35x for CDW customers without any knobs. Today, we are excited to announce the public preview of another innovative leap, Predictive I/O for Updates, providing you with up to 10x faster MERGE, UPDATE, and DELETE query […]

Continue Reading

Announcing the General Availability of Predictive I/O for Reads

[ad_1] Today, we are excited to announce the general availability of Predictive I/O for Databricks SQL (DB SQL): a machine learning powered feature to make your point lookups faster and cheaper. Predictive I/O leverages the years of experience Databricks has in building large AI/ML systems to make the Lakehouse the smartest data warehouse with no […]

Continue Reading