PyTorch’s Next Generation of Data Tooling
An overview and lookahead of our data efforts within PyTorch, including our new API extension points to support state-of-the-art ML data processing in both research and production. TorchData, an extensible library for constructing data loading …
Workshop: Building Real-Time ML Features with Feast, Spark, Redis, and Kafka
This workshop will focus on the core concepts underlying Feast, the open source feature store. We’ll explain how Feast integrates with underlying data infrastructure including Spark, Redis, and Kafka, to provide an interface between models and …
Empowering Small Businesses with the Power of Tech, Data, and Machine Learning
Data and machine learning shape Faire’s marketplace – and as a company that serves small business owners, our primary goal is to increase sales for both brands and retailers using our platform. During this session, we’ll discuss the machine …
Weaver: CashApp’s Real Time ML Ranking System
In this session, we will talk about one of the core infrastructure systems to personalize the experience on the CashApp, Weaver, and the work we did to scale it. Weaver is our real-time ML ranking system to rank items for search and recommendation …
Feature Engineering at Scale with Dagger and Feast
Dagger or Data Aggregator is an easy-to-use, configuration over code, cloud-native framework built on top of Apache Flink for stateful processing of real-time streaming data. With Dagger, you don’t need to write custom applications or manage …
Streamlining NLP Model Creation and Inference
At Primer we deliver applications with cutting-edge NLP models to surface actionable information from vast stores of unstructured text. The size of these models and our applications’ latency requirements create an operational challenge of deploying …
Is Production RL at a tipping point?
Reinforcement Learning has historically not been as widely adopted in production as other learning approaches (particularly supervised learning), despite being capable of addressing a broader set of problems. But we are now seeing an exponential …
Declarative Machine Learning Systems: Ludwig & Predibase
Declarative Machine Learning Systems are a new trend that marries the flexibility of DIY machine learning infrastructure and the simplicity of AutoML solutions. In this talk we will discuss about Ludwig, the open source declarative deep learning …
Accelerating Model Deployment Velocity
All ML teams need to be able to translate offline gains to online performance. Deploying ML models to production is hard. Making sure that those models stay fresh and performant can be even harder. In this talk, we will cover the value of regularly …
How to Draw an Owl and Build Effective ML Stacks
They’re handing us an engine, transmission, breaks, and chassis and asking us to build a fast, safe, and reliable car,” a data scientist at a recently IPO’ed tech company opined, while describing the challenges he faces in delivering ML …
Why is Machine Learning Hard?
Each of us has a different answer for “why is machine learning so hard.” And how long you have been working on ML will drastically influence your answer. I’ll share what I learned over the past 20 years, implementing everything from scratch …
Compass: Composable and Scalable Signals Engineering
Abnormal Security identifies and blocks advanced social engineering attacks in an ever-changing threat landscape, and so rapid feature development is of paramount importance for staying ahead of attackers. As we’ve scaled our machine learning system …