

DataTalks.Club
DataTalks.Club
DataTalks.Club - the place to talk about data!
Episodes
Mentioned books

Mar 22, 2024 • 58min
Building Production Search Systems - Daniel Svonava
Links:
VectorHub: https://superlinked.com/vectorhub/?utm_source=community&utm_medium=podcast&utm_campaign=datatalks
Daniel's LinkedIn: https://www.linkedin.com/in/svonava/
Free Data Engineering course: https://github.com/DataTalksClub/data-engineering-zoomcamp
Join DataTalks.Club: https://datatalks.club/slack.html
Our events: https://datatalks.club/events.html
This podcast is sponsored by VectorHub, a free open-source learning community for all things vector embeddings and information retrieval systems.

Mar 16, 2024 • 57min
Building Machine Learning Products - Reem Mahmoud
We talked about:
Reem’s background
Context-aware sensing and transfer learning
Shifting focus from PhD to industry
Reem’s experience with startups and dealing with prejudices towards PhDs
AI interviewing solution
How candidates react to getting interviewed by an AI avatar
End-to-end overview of a machine learning project
The pitfalls of using LLMs in your process
Mitigating biases
Addressing specific requirements for specific roles
Reem’s resource recommendations
Links:
LinkedIn: https://www.linkedin.com/in/reemmahmoud/recent-activity/all/
Website: https://topmate.io/reem_mahmoud
Free Data Engineering course: https://github.com/DataTalksClub/data-engineering-zoomcamp
Join DataTalks.Club: https://datatalks.club/slack.html
Our events: https://datatalks.club/events.html

Feb 23, 2024 • 56min
Make an Impact Through Volunteering Open Source Work - Sara EL-ATEIF
We talked about:
Sara’s background
On being a Google PhD fellow
Sara’s volunteer work
Finding AI volunteer work
Sara’s Fruit Punch challenge
How to take part in AI challenges
AI Wonder Girls
Hackathons
Things people often miss in AI projects and hackathons
Getting creative
Fostering your social media
Tips on applying for volunteer projects
Why it’s worth doing volunteer projects
Opportunities for data engineers and students
Sara’s newsletter suggestions
Links:
Dev and AI hackathons: https://devpost.com/
Healthcare-focused challenges: https://grand-challenge.org/challenges/
Volunteering in projects (AI4Good): https://www.fruitpunch.ai/
Volunteering in projects (AI4Good) 2: https://www.omdena.com/
Twitter: https://twitter.com/el_ateifSara
Instagram: https://www.instagram.com/saraelateif/
LinkedIn: https://www.linkedin.com/in/sara-el-ateif/
Youtube: www.youtube.com/@elateifsara
Free Data Engineering course: https://github.com/DataTalksClub/data-engineering-zoomcamp
Join DataTalks.Club: https://datatalks.club/slack.html
Our events: https://datatalks.club/events.html

Feb 2, 2024 • 53min
Accelerating The Job Hunt for The Perfect Job in Tech - Sarah Mestiri
Sarah Mestiri, a data scientist and career coach, discusses supporting women in the data field, narrowing down focus when job hunting in tech, building relationships through informational interviews, researching companies and job requirements, applying for data engineering roles while taking a course, the value of sharing and engaging in the learning process, and discovering skills and learning resources.

Jan 31, 2024 • 53min
Machine Learning Engineering in Finance - Nemanja Radojkovic
We talked about:
Nemanja’s background
When Nemanja first work as a data person
Typical problems that ML Ops folks solve in the financial sector
What Nemanja currently does as an ML Engineer
The obstacle of implementing new things in financial sector companies
Going through the hurdles of DevOps
Working with an on-premises cluster
“ML Ops on a Shoestring” (You don’t need fancy stuff to start w/ ML Ops)
Tactical solutions
Platform work and code work
Programming and soft skills needed to be an ML Engineer
The challenges of transitioning from and electrical engineering and sales to ML Ops
The ML Ops tech stack for beginners
Working on projects to determine which skills you need
Links:
LinkedIn: https://www.linkedin.com/in/radojkovic/
Free Data Engineering course: https://github.com/DataTalksClub/data-engineering-zoomcamp
Join DataTalks.Club: https://datatalks.club/slack.html
Our events: https://datatalks.club/events.html

Jan 24, 2024 • 56min
Stock Market Analysis with Python and Machine Learning - Ivan Brigida
We talked about:
Ivan’s background
How Ivan became interested in investing
Getting financial data to run simulations
Open, High, Low, Close, Volume
Risk management strategy
Testing your trading strategies
Sticking to your strategy
Important metrics and remembering about trading fees
Important features
Deployment
How DataTalks.Club courses helped Ivan
Ivan’s site and course sign-up
Links:
Exploring Finance APIs: https://pythoninvest.com/long-read/exploring-finance-apis
Python Invest Blog Articles: https://pythoninvest.com/blog
Free ML Engineering course: http://mlzoomcamp.com
Join DataTalks.Club: https://datatalks.club/slack.html
Our events: https://datatalks.club/events.html

Jan 22, 2024 • 54min
Bayesian Modeling and Probabilistic Programming - Rob Zinkov
We talked about:
Rob’s background
Going from software engineering to Bayesian modeling
Frequentist vs Bayesian modeling approach
About integrals
Probabilistic programming and samplers
MCMC and Hakaru
Language vs library
Encoding dependencies and relationships into a model
Stan, HMC (Hamiltonian Monte Carlo) , and NUTS
Sources for learning about Bayesian modeling
Reaching out to Rob
Links:
Book 1: https://bayesiancomputationbook.com/welcome.html
Book/Course: https://xcelab.net/rm/statistical-rethinking/
Free ML Engineering course: http://mlzoomcamp.com
Join DataTalks.Club: https://datatalks.club/slack.html
Our events: https://datatalks.club/events.html

Dec 27, 2023 • 57min
Navigating Challenges and Innovations in Search Technologies - Atita Arora
We talked about:
Atita’s background
How NLP relates to search
Atita’s experience with Lucidworks and OpenSource Connections
Atita’s experience with Qdrant and vector databases
Utilizing vector search
Major changes to search Atita has noticed throughout her career
RAG (Retrieval-Augmented Generation)
Building a chatbot out of transcripts with LLMs
Ingesting the data and evaluating the results
Keeping humans in the loop
Application of vector databases for machine learning
Collaborative filtering
Atita’s resource recommendations
Links:
LinkedIn: https://www.linkedin.com/in/atitaarora/
Twitter: https://x.com/atitaarora
Github: https://github.com/atarora
Human-in-the-Loop Machine Learning: https://www.manning.com/books/human-in-the-loop-machine-learning
Relevant Search: https://www.manning.com/books/relevant-search
Let's learn about Vectors: https://hub.superlinked.com/
Langchain: https://python.langchain.com/docs/get_started/introduction
Qdrant blog: https://blog.qdrant.tech/
OpenSource Connections Blog: https://opensourceconnections.com/blog/
Free ML Engineering course: http://mlzoomcamp.com
Join DataTalks.Club: https://datatalks.club/slack.html
Our events: https://datatalks.club/events.html

4 snips
Dec 19, 2023 • 56min
The Entrepreneurship Journey: From Freelancing to Starting a Company - Adrian Brudaru
We talked about:
Adrian’s background
The benefits of freelancing
Having an agency vs freelancing
What let Adrian switch over from freelancing
The conception of DLT (Growth Full Stack)
The investment required to start a company
Growth through the provision of services
Growth through teaching (product-market fit)
Moving on to creating docs
Adrian’s current role
Strategic partnerships and community growth through DocDB
Plans for the future of DLT
DLT vs Airbyte vs Fivetran
Adrian’s resource recommendations
Links:
Adrian's LinkedIn: https://www.linkedin.com/in/data-team/
Twitter: https://twitter.com/dlt_library
Github: https://github.com/dlt-hub/dlt
Website: https://dlthub.com/docs/intro
Free ML Engineering course: http://mlzoomcamp.com
Join DataTalks.Club: https://datatalks.club/slack.html
Our events: https://datatalks.club/events.html

Dec 17, 2023 • 55min
Become a Data Freelancer - Dimitri Visnadi
We talked about:
Dimitri’s background
The first steps of transitioning into freelance
Working with recruiters (contracting)
Deciding on what to charge for your services
Establishing your network
Self-marketing
Contracting vs freelancing
Which channel is better for those starting out?
Cutting out the middleman
Where to look for clients and how to vet them
The different way of getting into freelancing
Going back to a full-time job after freelancing
Common mistakes freelancers make
Dimitri’s resource suggestions
Reaching out to Dimitri
Links:
LinkedIn profile: http://www.linkedin.com/in/visnadi
The DataFreelancer website: https://thedatafreelancer.com/
Free ML Engineering course: http://mlzoomcamp.com
Join DataTalks.Club: https://datatalks.club/slack.html
Our events: https://datatalks.club/events.html