Efficient Fine-Tuning with LoRA: A Guide to Optimal Parameter Selection for Large Language Models

With the rapid advancement of neural network-based techniques and Large Language Model (LLM) research, businesses are increasingly interested in AI applications for value generation. They employ various machine learning approaches, both generative and non-generative, to address text-related challenges such as classification, summarization, sequence-to-sequence tasks, and controlled text generation. Organizations can opt for third-party APIs, but […]

Continue Reading

Building hyperlocal GrabMaps

Introduction Southeast Asia (SEA) is a dynamic market, very different from other parts of the world. When travelling on the road, you may experience fast-changing road restrictions, new roads appearing overnight, and high traffic congestion. To address these challenges, GrabMaps has adapted to the SEA market by leveraging big data solutions. One of the solutions […]

Continue Reading

Using MLflow AI Gateway and Llama 2 to Build Generative AI Apps

To build customer support bots, internal knowledge graphs, or Q&A systems, customers often use Retrieval Augmented Generation (RAG) applications which leverage pre-trained models together with their proprietary data. However, the lack of guardrails for secure credential management and abuse prevention prohibits customers from democratizing access and development of these applications. We recently announced the MLflow […]

Continue Reading

Streamlining Grab’s Segmentation Platform with faster creation and lower latency

Launched in 2019, Segmentation Platform has been Grab’s one-stop platform for user segmentation and audience creation across all business verticals. User segmentation is the process of dividing passengers, driver-partners, or merchant-partners (users) into sub-groups (segments) based on certain attributes. Segmentation Platform empowers Grab’s teams to create segments using attributes available within our data ecosystem and […]

Continue Reading

Delta UniForm: a universal format for lakehouse interoperability

One of the key challenges that organizations face when adopting the open data lakehouse is selecting the optimal format for their data. Among the available options, Linux Foundation Delta Lake, Apache Iceberg, and Apache Hudi are all excellent storage formats that enable data democratization and interoperability. Any of these formats is better than putting your […]

Continue Reading

Multiple Stateful Operators in Structured Streaming

In the world of data engineering, there are operations that have been used since the birth of ETL. You filter. You join. You aggregate. Finally, you write the result. While these data operations have remained the same over time, the range of latency and throughput requirements has changed dramatically. Processing a few events at a […]

Continue Reading

Smooth Sailing Ahead | Databricks Blog

The Databricks Container Infra team builds cloud-agnostic infrastructure and tooling for building, storing and distributing container images. Recently, the team worked on scaling Harbor, an open-source container registry. Request loads on Harbor are read-heavy and bursty and it is a critical component of Databricks’ serverless product – anytime new serverless VMs are provisioned, Harbor gets […]

Continue Reading

Unsupervised graph anomaly detection – Catching new fraudulent behaviours

Earlier in this series, we covered the importance of graph networks, graph concepts, graph visualisation, and graph-based fraud detection methods. In this article, we will discuss how to automatically detect new types of fraudulent behaviour and swiftly take action on them. One of the challenges in fraud detection is that fraudsters are incentivised to always […]

Continue Reading

Announcing the MLflow AI Gateway

Large Language Models (LLMs) unlock a wide spectrum of potential use cases to deliver business value, from analyzing the sentiment of text data stored in a SQL warehouse to deploying real-time chat bots that answer nuanced questions about your products. However, democratizing access to powerful SaaS and open source LLMs for these applications comes with […]

Continue Reading

eBay Execs Talk Generative AI and Computer Vision at VentureBeat Transform Conference

Chief AI Officer Nitzan Mekel-Bobrov and Vice President of Seller Experience Xiaodi Zhang appeared at the VentureBeat Transform conference on Tuesday, July 11th, to discuss generative AI, how eBay has been building AI infrastructure for many years and ways that recent evolutions in the technology can help sellers, buyers and employees. Xiaodi also spoke as […]

Continue Reading