Engineering Archives - Page 23 of 30

How eBay Made Its New Accessibility Tool — And Made It Available to All

March 14, 2023March 14, 2023Posted by adminLeave a Comment

There is sometimes a fundamental gap between the engineering and design teams when creating a new product. Designers want their work to be accessible, but many of the available tools are cumbersome, confusing, and come with processes that aren’t well-defined. This can lead to designers delivering their work to engineers without fully baked accessibility, which […]

Unsupervised Outlier Detection on Databricks

March 14, 2023March 14, 2023Posted by adminLeave a Comment

Kakapo (KAH-kə-poh) implements a standard set of APIs for outlier detection at scale on Databricks. It provides an integration of the vast PyOD library of outlier detection algorithms with MLFlow for tracking and packaging of models and hyperopt for exploring vast, complex and heterogeneous search spaces. The views expressed in this article are privately […]

Migrating from Role to Attribute-based Access Control

March 9, 2023March 9, 2023Posted by adminLeave a Comment

Grab has always regarded security as one of our top priorities; this is especially important for data platform teams. We need to control access to data and resources in order to protect our consumers and ensure compliance with various, continuously evolving security standards. Additionally, we want to keep the process convenient, simple, and easily scalable […]

Scalable Spark Structured Streaming for REST API Destinations

March 2, 2023March 2, 2023Posted by adminLeave a Comment

Spark Structured Streaming is the widely-used open source engine at the foundation of data streaming on the Databricks Lakehouse Platform. It can elegantly handle diverse logical processing at volumes ranging from small-scale ETL to the largest Internet services. This power has led to adoption in many use cases across industries. Another strength of Structured Streaming […]

Securing GitOps pipelines

March 1, 2023March 1, 2023Posted by adminLeave a Comment

Introduction Grab’s real-time data platform team, Coban, has been managing infrastructure resources via Infrastructure-as-code (IaC). Through the IaC approach, Terraform is used to maintain infrastructure consistency, automation, and ease of deployment of our streaming infrastructure, notably: With Grab’s exponential growth, there needs to be a better way to scale infrastructure automatically. Moving towards GitOps processes […]

Announcing Ray support on Databricks and Apache Spark Clusters

February 28, 2023February 28, 2023Posted by adminLeave a Comment

Ray is a prominent compute framework for running scalable AI and Python workloads, offering a variety of distributed machine learning tools, large-scale hyperparameter tuning capabilities, reinforcement learning algorithms, model serving, and more. Similarly, Apache Spark™ provides a wide variety of high-performance algorithms for distributed machine learning through Spark MLlib and deep integrations with machine learning […]

New Maven Dependency Resolution Algorithm

February 27, 2023February 27, 2023Posted by adminLeave a Comment

Introduction Maven is widely used as a Java project build tool at eBay. As an essential component of Maven, maven-resolver resolves declared dependencies, calculates dependency graphs, mediates conflicts and forms the classpaths for compilation and deployment. This is the so-called dependency resolution process. One of the main impediments to fast iterations of software development was […]

New zoom freezing feature for Geohash plugin

February 21, 2023February 21, 2023Posted by adminLeave a Comment

Introduction Geohash is an encoding system with a unique identifier for each region on the planet. Therefore, all geohash units can be associated with an individual set of digits and letters. Geohash is a plugin built by Grab that is available in the Java OpenStreetMap Editor (JOSM) tool, which comes in handy for those who […]

Accelerate your model development with the new MLflow Experiments UI

February 17, 2023February 17, 2023Posted by adminLeave a Comment

MLflow is the premier platform for model development and experimentation. Thousands of data scientists use MLflow Experiment Tracking every day to find the best candidate models through a powerful GUI-based experience which allows them to view, filter, and sort models based on parameters, performance metrics, and source information. Today, we are thrilled to announce several […]

Getting started with NLP using Hugging Face transformers pipelines

February 6, 2023February 6, 2023Posted by adminLeave a Comment

Advances in Natural Language Processing (NLP) have unlocked unprecedented opportunities for businesses to get value out of their text data. Natural Language Processing can be used for a wide range of applications, including text summarization, named-entity recognition (e.g. people and places), sentiment classification, text classification, translation, and question answering. In many cases, you can get […]

Category: Engineering

How eBay Made Its New Accessibility Tool — And Made It Available to All

Unsupervised Outlier Detection on Databricks

Migrating from Role to Attribute-based Access Control

Scalable Spark Structured Streaming for REST API Destinations

Securing GitOps pipelines

Announcing Ray support on Databricks and Apache Spark Clusters

New Maven Dependency Resolution Algorithm

New zoom freezing feature for Geohash plugin

Accelerate your model development with the new MLflow Experiments UI

Getting started with NLP using Hugging Face transformers pipelines

Categories

Latest News

Local and landscape scale factors influence pollinators at solar parks – The Applied Ecologist

simple tips for eating well on holiday

UN Report on Sri Lanka Sparks Global Concern Over Human Rights Violations

🔥 Iran Heatwave Triggers Office Closures Amid Strain on Water and Power Infrastructure

Statement by Commissioner Lahbib on the 2025 World Humanitarian Day

📰 Newsmax to Pay $67 Million in Defamation Settlement Over 2020 Election Misinformation

Pages

Enjoy this blog? Please spread the word :)