Synopsis
Kirill Eremenko is a Data Science coach and lifestyle entrepreneur. The goal of the Super Data Science podcast is to bring you the most inspiring Data Scientists and Analysts from around the World to help you build your successful career in Data Science. Data is growing exponentially and so are salaries of those who work in analytics. This podcast can help you learn how to skyrocket your analytics career. Big Data, visualization, predictive modeling, forecasting, analysis, business processes, statistics, R, Python, SQL programming, tableau, machine learning, hadoop, databases, data science MBAs, and all the analytcis tools and skills that will help you better understand how to crush it in Data Science.
Episodes
-
637: How to Influence Others with Your Data
20/12/2022 Duration: 01h07minIt's all about data visualization this week as Jon Krohn welcomes Ann K. Emery, data visualization designer and owner of Depict Data Studio, to the show. If you want to learn data viz best practices, tips and tricks and reporting how-tos, make some time to tune in today! This episode is brought to you by Kolena (kolena.io), the testing platform for machine learning. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn:• What data storytelling is [3:40]• Pinpoints of data visualization [10:38]• Best practices for data visualization [23:41]• Surprising spreadsheet tricks [30:51]• When static dashboards are more effective than interactive ones [43:30]• Ann's top tips for presenting data in a slideshow [48:07] Additional materials: www.superdatascience.com/637
-
636: The Equality Machine
16/12/2022 Duration: 22minDigital literacy and data bias: Can one reduce or even eradicate the other? Law professor Orly Lobel speaks with SDS host Jon Krohn about Orly’s latest book, The Equality Machine, which offers an optimistic look into the future of AI and data mining. Additional materials: www.superdatascience.com/636 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
-
635: The Perils of Manually Labeling Data for Machine Learning Models
13/12/2022 Duration: 01h18minHand labeling data and information bias: Jon Krohn speaks with Watchful CEO Shayan Mohanty about the pitfalls of data analysis when bias comes into the equation (spoiler alert: it always does), the importance of the Chomsky hierarchy in data management, and the importance of simulation engines for returning real-time results to users. This episode is brought to you by Iterative (iterative.ai), your mission control center for machine learning. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn:• Why bias in general is good [04:06]• The arguments against hand labeling [09:47]• How Shayan solves the problem of labeling at his company [24:26]• Misconceptions concerning hand-labeled data [43:25]• What the Chomsky hierarchy is [52:38]• Watchful’s high-performance simulation engine [1:04:51]• What Shayan looks for in his new hires [1:08:15] Additional materials: www.superdatascience.com/635
-
634: Model Error Analysis
09/12/2022 Duration: 06minData scientist and author Serg Masís joins Jon Krohn for a Five-Minute Friday episode that touches on model error analysis. Learn how this process can improve your models and discover a helpful tool that expedites this critical process. Additional materials: www.superdatascience.com/634 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
-
633: Responsible Decentralized Intelligence
06/12/2022 Duration: 53minThis week's episode is all about Responsible Decentralized Intelligence as award-winning professor and tech entrepreneur, Dawn Song, joins Jon Krohn to help us explore this exciting topic in-depth. This episode is brought to you by Iterative (iterative.ai), your mission control center for machine learning. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn:• What is decentralized intelligence? [3:46]• Dawn’s Responsible Data Economy collaboration with Meta AI [11:31]• How homomorphic encryption, differential privacy, and multi-party computation can work together [16:22]• How PrivateSQL makes differential privacy easy to use [22:54]• The relationship between deep learning and federated learning [37:55]• What is a responsible data economy [42:13] Additional materials: www.superdatascience.com/633
-
632: Liquid Neural Networks
02/12/2022 Duration: 10minLiquid neural networks are a type of bio-inspired machine learning set to make a huge impact in the field of data analytics. On this week’s Five-Minute Friday, Jon Krohn speaks with Pathway.com Co-Founder Dr. Adrian Kosowski about the development of this new type of network and what this means for the future of data.Additional materials: www.superdatascience.com/630Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
-
631: Data Analytics Career Orientation
29/11/2022 Duration: 58minInterview success, funny memes about data, and stakeholder management: Jon Krohn speaks with Luke Barousse, a full-time YouTuber who produces content to help aspiring data scientists. First, Jon and his guest go underwater to find out how data science can help you while working on a submarine before they emerge onto Luke’s YouTube channel. There, he discloses all the helpful hacks for data science beginners—with a generous helping of humor! As founder of MacroFit, a data-driven company that helps with meal planning, Luke is no stranger to portion sizes… This episode is brought to you by Iterative (iterative.ai), your mission control center for machine learning. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn:• Where Luke gets his inspiration for making YouTube videos [04:46]• How Luke got into creating comedy skits [08:21]• Luke’s favorite Python libraries for web scraping [14:41]• Incorrect assumptions that as
-
630: Resilient Machine Learning
25/11/2022 Duration: 06minJon Krohn sits with Dr. Dan Shiebler at the Open Data Science Conference (ODSC) to dive into the critical components of building resilient machine learning. Additional materials: www.superdatascience.com/630 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
-
629: Software for Efficient Data Science
22/11/2022 Duration: 01h11minHas the term developer advocacy ever left you scratching your head? This week data science developer advocate for JetBrains, Dr. Jodie Burchell, joins Jon Krohn to shed light on her responsibilities and why it's a role you might want to consider. Jodie also dives into building reproducible data science workflows and the keys to working effectively with real-world data.This episode is brought to you by Iterative (iterative.ai), the open-source company behind DVC. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.In this episode you will learn:• Jodie’s background in psychology [2:22]• Jodie's tips for real-world data preparation [6:55]• Tour JetBrains' developer tools: PyCharm, DataSpell and Datalore [10:41]• What is a data science developer advocate? [38:47]• The books that Jodie's co-authored [46:18]• Jodie's favorite Python libraries [58:33]• How to have reproducible data science workflows [1:01:36]Additional materials: www.superdatascience.c
-
628: The Critical Human Element of Successful A.I. Deployments
18/11/2022 Duration: 05minOn this episode of Five-Minute Friday, Jon Krohn speaks from the Open Data Science Conference (ODSC). There, he sits down with author and data scientist Keith McCormick to discuss the conference’s key trend: learning the importance of trust in the relationship between humans and algorithms. Additional materials: www.superdatascience.com/628 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
-
627: AutoML: Automated Machine Learning
15/11/2022 Duration: 01h30minJon Krohn speaks with Erin LeDell, H2O.ai’s Chief Machine Learning Scientist. They investigate how AutoML supercharges the data science process, the importance of admissible machine learning for an equitable data-driven future, and what Erin’s group Women in Machine Learning & Data Science is doing to increase inclusivity and representation in the field. This episode is brought to you by Datalore (datalore.online/SDS), the collaborative data science platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn:• The H2O AutoML platform Erin developed [07:43]• How genetic algorithms work [19:17]• Why you should consider using AutoML? [28:15]• The “No Free Lunch Theorem” [33:45]• What Admissible Machine Learning is [37:59]• What motivated Erin to found R-Ladies Global and Women in Machine Learning and Data Science [47:00]• How to address bias in datasets [57:03] Additional materials: www.superdatascience.com/627
-
626: Subword Tokenization with Byte-Pair Encoding
11/11/2022 Duration: 06minWord tokenization, character tokenization and subword tokenization go head-to-head this week as Jon Krohn delivers a mini-bootcamp on the NLP-related process. Additional materials: www.superdatascience.com/626 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
-
625: Analyzing Blockchain Data and Cryptocurrencies
08/11/2022 Duration: 01h04minChainalysis' Director of Research, Kim Grauer joins Jon Krohn to explore the state of economic-data analysis on the blockchain. This episode is brought to you by Datalore (datalore.online/SDS), the collaborative data science platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • Kim's role as Director of Research [5:02] • The unique real-time economic-data analytics of the blockchain [13:07] • How ML can predict patterns of criminal activity on the blockchain [18:56] • Interesting use cases of ML for crime investigation [29:37] • The tools and approaches Kim uses daily [47:44] • The future of crypto, blockchains, and data science [50:54] • Why a data science bootcamp helps people break into data science [53:42] Additional materials: www.superdatascience.com/625
-
624: Imagen Video: Incredible Text-to-Video Generation
04/11/2022 Duration: 07minOn this week’s Five-Minute Friday, Jon Krohn investigates Imagen Video, Google’s latest model for making video art out of text prompts. Recently published, this text-to-image converter now competes against already strong competitors on the scene like DALL-E 2. Unlike DALL-E 2, it returns moving images or time-based media. Tune in to hear Jon explain the technology that made Imagen Video the tech giant’s shiniest new tool to date. Additional materials: www.superdatascience.com/624 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
-
623: Data Analyst, Data Scientist, and Data Engineer Career Paths
01/11/2022 Duration: 01h11minJon Krohn speaks with Shashank Kalanithi, the man who makes a sport out of YouTube and data analytics out of sports. Listen in as he talks about how he got started producing YouTube videos on data science, the essential differences between data science roles, and how data could shape the future of the sports industry. This episode is brought to you by Datalore (datalore.online/SDS), the collaborative data science platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn:• What motivated Shashank to start his YouTube channel [04:31]• The must-have technical skills for every data scientist [16:59]• The soft skills needed for data science [20:52]• The differences between data analyst, data scientist and data engineer [24:26]• How data are currently being applied in the sports industry [38:38]• The “needs” divide between digital native and traditional companies [45:34] Additional materials: www.superdatascience.com
-
622: Burnout: Causes and Solutions
28/10/2022 Duration: 24minIs burnout on the horizon for you and your team? Christina Maslach, author of the new book "The Burnout Challenge," joins Jon Krohn to help us identify the common signs of looming burnout while steering us in a healthier direction. Additional materials: www.superdatascience.com/622 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
-
621: Blockchains and Cryptocurrencies: Analytics and Data Applications
25/10/2022 Duration: 01h11minCryptocurrency and blockchain take center stage this week as we welcome Chief Economist at Chainalysis, Philip Gradwell, to discuss the data science applications in this exciting field. This episode is brought to you by Datalore (datalore.online/SDS), the collaborative data science platform, by Zencastr (zen.ai/sds), the easiest way to make high-quality podcasts, and by Bunch (superdatascience.com/bunch), the AI driven leadership coach. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn:• What the role of a chief economist entails [5:50]• What are blockchains and cryptocurrency? [8:23]• How analyzing cryptocurrencies differs from established fiat currencies [12:48]• Philip's work at Chainalysis [26:07]• Philip's crypto data analytics pipeline [34:48]• How Philip develops data products for a wide range of users [46:18]• How the blockchain facilitates innovative computing and machine learning technologies [51:52]• W
-
620: OpenAI Whisper: General-Purpose Speech Recognition
21/10/2022 Duration: 06minWhat’s your secret to superb audio recognition? Whisper it. We mean that literally—Whisper is the latest in OpenAI’s growing suite of models aimed to benefit humanity. On this episode of Five-Minute Friday, host Jon Krohn reviews OpenAI’s latest model, Whisper. This tool will vastly improve the way human speech is recognized and converted to text. Jon gets under the hood to show how the team managed to get such a powerfully accurate recognition model. Listen to the episode and find out how you can try it yourself, for free! Additional materials: www.superdatascience.com/620Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
-
619: Tools for Deploying Data Models into Production
18/10/2022 Duration: 01h20minJon Krohn speaks with Erik Bernhardsson, the man who invented Spotify’s original music recommendation system. They address the different ways to interview a data science candidate, how to deploy a data model into the cloud, and the approach he took that made Spotify go from a digital music startup to an AI-driven streaming giant. This episode is brought to you by Datalore (datalore.online/SDS), the collaborative data science platform, by Zencastr (zen.ai/sds), the easiest way to make high-quality podcasts, and by Bunch (superdatascience.com/bunch), the AI driven leadership coach. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn:• The data problem that Erik’s company Modal Labs solves [04:32]• Erik’s prolific blogging career [09:15]• Opportunities for making data teams more efficient and productive [14:42]• Erik’s views on interviewing data scientists and software developers [20:18]• Erik’s tips and tricks for da
-
618: The Joy of Atelic Activities
14/10/2022 Duration: 03minTelic and atelic activities take center stage this week as Jon Krohn contemplates how our daily actions contribute to our overall sense of fulfillment. Additional materials: www.superdatascience.com/618Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.