Databricks announces significant improvements to the built-in LLM judges in Agent Evaluation

An improved answer-correctness judge in Agent Evaluation Agent Evaluation enables Databricks customers to define, measure, and understand how to improve the quality of agentic GenAI applications. Measuring the quality of ML outputs takes a new dimension of complexity for GenAI applications, especially in industry-specific contexts dealing with customer data: the inputs may comprise complex open-ended […]

Continue Reading

Revolutionizing Insight into Heavy Equipment Maintenance with GenAI

Maintaining heavy equipment assets, such as oil rigs, agricultural combines, or fleets of vehicles, poses an extremely complex challenge for global companies. These assets are often spread across the globe, while their maintenance schedules and lifecycles are typically determined at a company-wide level. The failure of a key component can result in millions of dollars […]

Continue Reading

Training Highly Scalable Deep Recommender Systems on Databricks (Part 1)

Recommender systems (RecSys) have become an integral part of modern digital experiences, powering personalized content suggestions across various platforms. These sophisticated systems and algorithms analyze user behavior, preferences, and item characteristics to predict and recommend items of interest. In the era of big data and machine learning, recommender systems have evolved from simple collaborative filtering […]

Continue Reading

A scalable experimentation and development platform for Notebook services

Key to innovation and improvement in machine learning (ML) models is the ability for rapid iteration. Our team, Chimera, part of the Artificial Intelligence (AI) Platform team, provides the essential compute infrastructure, ML pipeline components, and backend services. This support enables our ML engineers, data scientists, and data analysts to efficiently experiment and develop ML […]

Continue Reading

Process streaming in DLT Framework

All the code is available in this GitHub repository. Introduction Synchronizing data from external relational databases like Oracle, MySQL, or a data warehouse into the Databricks Data Intelligence Platform is a common use case. Databricks offers multiple approaches ranging from LakeFlow Connect’s simple and efficient ingestion connectors to Delta Live Tables’ (DLT) flexibility with APPLY […]

Continue Reading

Announcing Hybrid Search General Availability in Mosaic AI Vector Search

We’re excited to announce the general availability of hybrid search in Mosaic AI Vector Search. Hybrid search is a powerful feature that combines the strengths of pre-trained embedding models with the flexibility of keyword search. In this blog post, we’ll explain why hybrid search is important, how it works, and how you can use it […]

Continue Reading

Unlock Faster Machine Learning with Graviton

We are excited to announce that Graviton, the ARM-based CPU instance offered by AWS, is now supported on the Databricks ML Runtime cluster. There are several ways that Graviton instances provide value for machine learning workloads: Speedups for various machine learning libraries: ML libraries like XGBoost, LightGBM, Spark MLlib, and Databricks Feature Engineering could see up to 30-50% […]

Continue Reading

eBay Introduces Intuitive Search Redesign to Elevate Shopper Experience

eBay has transformed its search experience with a major redesign aimed at providing a seamless, intuitive, and visually rich shopping journey. After 18 months of extensive user research and testing, eBay’s new search interface features larger, high-resolution images, a modernized layout, and streamlined navigation.  These enhancements empower buyers to make informed decisions quickly and efficiently, […]

Continue Reading

Harnessing the Power of Databricks Mosaic AI for Image Generation at Rolls-Royce

Rolls-Royce has witnessed the transformative power of the Databricks Data Intelligence Platform in various AI projects. One example is a collaboration between Rolls-Royce and Databricks, focused on optimizing Conditional Generative Adversarial Network (GCN) training processes, that demonstrate the numerous benefits of using Databricks Mosaic AI tools. For this joint cGAN training optimization project, the team […]

Continue Reading

How we improved translation experience with cost efficiency

Introduction As COVID restrictions were fully lifted in 2023, the number of tourists grew dramatically. People began to explore the world again, frequently using the Grab app to make bookings outside of their home country. However, we noticed that communication posed a challenge for some users. Despite our efforts to integrate an auto-translation feature in […]

Continue Reading