Introducing the Support of Lateral Column Alias

We are thrilled to introduce the support of a new SQL feature in Apache Spark and Databricks: Lateral Column Alias (LCA). This feature simplifies complex SQL queries by allowing users to reuse an expression specified earlier in the same SELECT list, eliminating the need to use nested subqueries and Common Table Expressions (CTEs) in many […]

Continue Reading

Introducing Apache Spark™ 3.5 | Databricks Blog

Today, we are happy to announce the availability of Apache Spark™ 3.5 on Databricks as part of Databricks Runtime 14.0. We extend our sincere appreciation to the Apache Spark community for their invaluable contributions to the Spark 3.5 release. Aligned with our mission to make Spark more accessible, versatile, and efficient than ever before, this […]

Continue Reading

Best Practices for LLM Evaluation of RAG Applications

Chatbots are the most widely adopted use case for leveraging the powerful chat and reasoning capabilities of large language models (LLM). The retrieval augmented generation (RAG) architecture is quickly becoming the industry standard for developing chatbots because it combines the benefits of a knowledge base (via a vector store) and generative models (e.g. GPT-3.5 and […]

Continue Reading

Introducing MLflow 2.7 with new LLMOps capabilities

As part of MLflow 2’s support for LLMOps, we are excited to introduce the latest updates to support prompt engineering in MLflow 2.7.  Assess LLM project viability with an interactive prompt interfacePrompt engineering is a great way to quickly assess if a use case can be solved with a large language model (LLM). With the […]

Continue Reading

Q&A With Dr. Anna Zeiter, Chief Privacy Officer, Associate General Counsel for Privacy, Data & AI

Artificial Intelligence (AI) has evolved and improved dramatically over the past 18 months, and the technology has myriad applications and amazing potential for the future of ecommerce. But with such lightning-quick advancements, it’s vital that we in the technology industry, especially companies like eBay with the size, scope, and personnel to guide the future of […]

Continue Reading

‘Magical’ Listing Tool Harnesses the Power of AI to Make Selling on eBay Faster, Easier, and More Accurate

We’ve been hard at work on the next version of a new, magical listing experience, which uses AI to analyze, research, and extrapolate information from a small amount of data provided by the seller — greatly simplifying the process of listing an item on eBay. In the first version, a seller could add in a […]

Continue Reading

Shared Clusters in Unity Catalog for the win: Introducing Cluster Libraries, Python UDFs, Scala, Machine Learning and more

We are thrilled to announce that you can run even more workloads on Databricks’ highly efficient multi-user clusters thanks to new security and governance features in Unity Catalog Data teams can now develop and run SQL, Python and Scala workloads securely on shared compute resources. With that, Databricks is the only platform in the industry […]

Continue Reading

What’s New for Shared Clusters in Unity Catalog

We are thrilled to announce great enhancements to onboard more workloads to Unity Catalog clusters in shared access mode, Databricks’ highly efficient, secure multi-user clusters. Data teams can now develop and run SQL, Python and Scala workloads securely on shared compute resources. With that, Databricks is the only platform in the industry offering fine-grained access […]

Continue Reading

Efficient Fine-Tuning with LoRA: A Guide to Optimal Parameter Selection for Large Language Models

With the rapid advancement of neural network-based techniques and Large Language Model (LLM) research, businesses are increasingly interested in AI applications for value generation. They employ various machine learning approaches, both generative and non-generative, to address text-related challenges such as classification, summarization, sequence-to-sequence tasks, and controlled text generation. Organizations can opt for third-party APIs, but […]

Continue Reading

Building hyperlocal GrabMaps

Introduction Southeast Asia (SEA) is a dynamic market, very different from other parts of the world. When travelling on the road, you may experience fast-changing road restrictions, new roads appearing overnight, and high traffic congestion. To address these challenges, GrabMaps has adapted to the SEA market by leveraging big data solutions. One of the solutions […]

Continue Reading