Deep learning techniques are being applied to tabular data in financial services at Capital One.
Transformers and transfer learning are emerging opportunities for improving deep learning on tabular data.
Deep dives
Capital One's Approach to Machine Learning Research
Capital One's Senior Director, Bion Bruce, discusses the company's approach to machine learning research focusing on tabular data. They aim to distill relevant advancements in machine learning to solve financial service problems efficiently.
Key Focus Areas in Machine Learning Research
Capital One emphasizes graph machine learning, explainability, anomaly detection, and privacy as key research areas. Graph machine learning offers insights into financial networks, while explainability and anomaly detection enhance fraud protection.
Challenges and Opportunities in Adopting Deep Learning for Tabular Data
Transitioning to deep learning for tabular data presents challenges like encoding, regularization, and tooling. The need for user-friendly tools resembling scikit-learn and XGBoost is crucial for wider adoption.
Future Research Directions in Deep Learning for Tabular Data
The research frontier in deep learning for tabular data includes self-supervised pre-training with transformers, efficient encoding, regularization techniques, and understanding complex architecture for diverse data sets. Developing tooling and simplified APIs for easy adoption is crucial for performance parity with established methods.
Today we’re joined by Bayan Bruss, a Sr. director of applied ML research at Capital One. In our conversation with Bayan, we dig into his work in applying various deep learning techniques to tabular data, including taking advancements made in other areas like graph CNNs and other traditional graph mining algorithms and applying them to financial services applications. We discuss why despite a “flood” of innovation in the field, work on tabular data doesn’t elicit as much fanfare despite its broad use across businesses, Bayan’s experience with the difficulty of making deep learning work on tabular data, and what opportunities have been presented for the field with the emergence of multi-modality and transformer models. We also explore a pair of papers from Bayan’s team, focused on both transformers and transfer learning for tabular data.
The complete show notes for this episode can be found at twimlai.com/go/591
Get the Snipd podcast app
Unlock the knowledge in podcasts with the podcast player of the future.
AI-powered podcast player
Listen to all your favourite podcasts with AI-powered features
Discover highlights
Listen to the best highlights from the podcasts you love and dive into the full episode
Save any moment
Hear something you like? Tap your headphones to save it with AI-generated key takeaways
Share & Export
Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more
AI-powered podcast player
Listen to all your favourite podcasts with AI-powered features
Discover highlights
Listen to the best highlights from the podcasts you love and dive into the full episode