Women In Data Science

  • Author: Vários
  • Narrator: Vários
  • Publisher: Podcast
  • Duration: 34:57:05
  • More information

Informações:

Synopsis

Hear from women leaders across the data science profession, as they share their advice, career highlights, and lessons learned along the way. This podcast is brought to you by the Stanford Institute for Computational & Mathematical Engineering (ICME) and the Stanford School of Engineering. Generous support for this podcast and other Women in Data Science initiatives has been provided by Intuit, Microsoft, SAP, Walmart Labs, and Western Digital.

Episodes

  • Leda Braga | Applying data science to investment strategies

    14/10/2022 Duration: 36min

    Leda Braga is the founder and CEO of Systematica Investments, a hedge fund that uses data science-driven models to support its investment strategies. Leda was born and raised in Brazil and found her way into the financial sector after getting her PhD in engineering and spending several years as an academic. Her financial career started with seven years in investment banking at JP Morgan and then she joined the hedge fund startup BlueCrest in 2000. She explains that while her funds did very well during the 2008 financial crisis, the time felt like an existential crisis because you didn’t know if the major investment banks were going to survive. But she said it was a formative time and she learned many lessons. Several years after the financial crisis, she spun off her own firm, Systematica Investments focused on systematic trading.Leda explains that systematic investment management is data science applied to investment. The systematic approach makes the investment process less reliant on the random nature of f

  • Jessica Bohórquez | Using AI for leak detection in water pipelines (Spanish)

    15/09/2022 Duration: 46min

    A Colombian engineer, Jessica is fascinated by the processes and complexity of water supply systems in urban areas.In her post doc research in Australia, she brings together her expertise on the water hammer and transient flow waves to create an AI model that is able to identify where pipeline defects are faster and more accurately than existing techniques.She explains that in data science, the most important stage is understanding the problem. You need to bring in basic knowledge of the problem and expertise from other disciplines that are involved in a problem and combine that with artificial intelligence. AI is an important tool but just part of the solution. It’s critical to maintain all the legacy of knowledge and understanding of a problem. AI can make it simpler to apply, but you can’t leave behind the physics or knowledge of the hydraulic part of water movement. Working in industry, she has found that it’s important to first understand how the system works. In these large companies in charge of delive

  • Karolina Urbanska | Using data science to study human behavior

    11/08/2022 Duration: 47min

    As a quantitative social psychologist, Karolina has always been interested in using data to measure human behavior to try to understand it better. She has researched questions around political attitudes and polarization, particularly in light of Brexit and Trump’s election in 2016. She wanted to understand how people could arrive at completely different understandings of the world and reflect it in their voting decisions. One of her findings was that in the American two-party political system, people tend to identify as either Republican or Democrat and are more likely to agree with statements from their identified party. People use identity cues as mental shortcuts to judge information because there’s simply too much information to decipher. She says the polarization is stronger in the US where there are just two major parties compared to other countries with more choice of multiple political parties.After her undergraduate and Ph.D. in psychology and two post-doctoral positions, Karolina decided to leave ac

  • Welcoming our new podcast co-host, Cindy Orozco

    15/06/2022 Duration: 51min

    EPISODE NOTESWiDS Executive Director Margot Gerritsen welcomes her new co-host, Cindy Orozco, in a wide-ranging conversation about their career paths and valuable learnings along the way. Cindy is thrilled to be joining as podcast co-host and believes that showcasing women at all stages of their careers shows that we “share the same fears or experiences every day. It's just that some of us have been on the path a little bit longer than others.” Cindy is an applied mathematician who is currently working as a machine learning solutions engineer at Cerebras Systems. Originally from Colombia, she loved applied math, and did a master's in civil engineering and mathematics from King Abdullah University of Science and Technology (KAUST), in Saudi Arabia, and a PhD in Computational and Mathematical Engineering from ICME at Stanford. She met Margot at Stanford and has been contributing to WiDS for many years at conferences, workshops and datathons.After answering some questions about herself, Cindy stepped right into

  • Tahu Kukutai | Advocating for indigenous data sovereignty

    09/12/2021 Duration: 38min

    When Tahu Kukutai’s father went to school, he wasn’t allowed to speak his native language. If he did, he would be hit by his teachers. While the situation for the Maori people in Aotearoa (the Maori name for New Zealand) has improved somewhat since then, Tahu has dedicated her career to advocating for the rights of Indigenous peoples to preserve their native language, identity, communities, and culture. In today’s world, power over data is a central component of indigenous self-determination. Government agencies decide what data gets collected on whom and how it gets used, shared, and stored. There's a long history of data colonialism and state surveillance of indigenous communities designed for government priorities. Indigenous data sovereignty provides a framework to determine what data are collected and how it’s used, the ethical framework and governance, and the intended beneficiaries. Indigenous peoples have a different perspective on data. Most western frameworks about data protection, rights, and priva

  • Allison Koenecke | Researching algorithmic fairness and causal inference in public health

    11/11/2021 Duration: 27min

    Allison Koenecke, who received her PhD from Stanford’s Institute for Computational and Mathematical Engineering (ICME), describes how her experiences in academia and industry shaped her decision to return to academia. Currently a postdoc at Microsoft Research in the Machine Learning and Statistics group, she starts as an Assistant Professor of Information Science at Cornell University next year. Her research interests lie at the intersection of economics and computer science, with projects focusing on fairness in algorithmic systems and causal inference in public health.​Allison says in her career so far, she has always tried to keep as many doors open as possible but recognized, at some point, you have to start closing doors and specialize. After getting her bachelor’s degree in mathematics from MIT, she worked in economic consulting for a few years and realized she wanted to do something with more social benefit. While she was working in industry and during summer internships, she kept in touch with profess

  • Karina Edmonds | Building bridges between business and academia

    07/10/2021 Duration: 38min

    Though Karina showed an early aptitude in math, her high school counselor advised her against pursuing an engineering degree. She ignored his advice and went on to earn her undergrad degree in mechanical engineering from the University of Rhode Island and a PhD in Aeronautics from Caltech. She landed her first job as a speech-to-text engineer at TRW where she was awarded her first patent. She then moved on to technology transfer as a patent agent at the Jet Propulsion Lab. She bounced back to academia, managing corporate partnerships for Caltech, and then returned to industry as Google’s University Lead for Google Cloud AI/Machine Learning. She is now at SAP as Global Head of Academies and University Alliances, continuing to connect industry and academia.In her diverse career spanning business and education, she has seen increasing power concentrated in big tech companies through their ownership of immense datasets and computational power. Companies are also attracting talent away from universities that are n

  • Fatima Abu Salem | Applying data science for the public good in Lebanon

    09/09/2021 Duration: 33min

    Fatima Abu Salem grew up in Lebanon and has focused her data science research on addressing critical challenges in the region, including problems around the Syrian refugee crisis, the water quality in Lebanon and their irrigation requirements for farmers. Fatima explains that her life journey is really enshrined in the conflict of the region. She was born to a Lebanese mother and a Palestinian father who had refugee status so she inherited this status. This meant there were many restrictions on what career she could pursue and her dream to become a doctor was out of reach. But she was good in math so majored in mathematics at the American University of Beirut and one of the only jobs available to her as a Palestinian was able to become a high school teacher. She taught for a few years but realized it wasn’t what she wanted to do with her life. At the time, she was working on her master’s thesis in Algorithmic Number Theory at the American University in Beirut and decided she wanted to pursue a PhD. She emaile

  • Louvere Walker-Hannon | Gaining skills and overcoming barriers to a career in data science

    12/08/2021 Duration: 48min

    Louvere Walker-Hannon has worked at MathWorks (the company that makes MATLAB) for over 21 years, where she’s also a STEM Ambassador. She studied biomedical engineering as an undergraduate at Boston University and did graduate work at Northeastern University in geographic information technology with a specialization in remote sensing.She loved working with MATLAB as an undergraduate and when MathWorks came to the career fair when she graduated, she sought them out, got an interview, and has been working there ever since.She says there are both technical and non-technical skills that are valuable in the field of data science. Technical skills include coding, some programming, a foundation in mathematics, some statistics, and in some cases physics. Non-technical skills are also very important. It’s critical to be able to communicate your findings clearly using a variety of techniques. She says stay away from technical jargon and communicate as if you’re having a conversation. A second important skill is active l

  • Menglin Cao | Data science in fintech and financial services

    15/07/2021 Duration: 29min

    After she earned her BA, MA, and PhD in economics from the University of Maryland, Menglin Cao spent six years at Fannie Mae before joining Wells Fargo. Over the past 15 years, she has seen a major shift in how financial institutions use data to drive business decisions. In the past, many business decisions were based upon the experience and judgement of senior executives, but today every decision must be backed up by data and analytics. Many aspects of the financial services that leverage data are great candidates for AI and ML models, such as customer experience, revenue generation, and risk management. She wants to ensure they build models that are fair, explainable, interpretable, and able to generate value.She offers several examples about best practices for success.To be useful, the data must first be focused on a foundational business question. “Data can tell many stories, but until you bring the story to the businesses and they can relate the story to a question, experience or concern that they have,

  • Karen Hao | Covering AI and Ethics Washing in the Tech Industry

    10/06/2021 Duration: 34min

    Karen Hao trained as a mechanical engineer and then joined a Silicon Valley startup, thinking that technology was the best means to create social change. While surrounded by smart people who were also passionate about using technology for social change, she soon discovered there were no incentives or pathways to accomplish this. “When you're inside a technology company and you're thinking this is going to help change the world, you're often blind to unattended consequences of your work,” she says.She decided to transition to a career in journalism where she could create social change by raising awareness about social impacts of technologies like AI, and how big tech companies engage in “ethics washing” to protect their profits. She is intrigued by the way that incentives shape the work that is done at a systemic level. She says every tech giant suffers from issues at the systemic level where there are people who deeply care about ethics within the organization, but it doesn't mean the company is willing to ch

  • Cecilia Aragon | Aerobatic Pilot, Author and Data Scientist

    06/05/2021 Duration: 32min

    The multi-talented Cecilia Aragon is a data scientist, professor, author and champion aerobatic pilot. In this podcast, she explains how learning to fly gave her the confidence to pursue her career in human-centered data science and as an author.Her book, Flying Free: My Victory Over Fear to Become the First Latina Pilot on the US Aerobatic Team, is the story of how a timid daughter of immigrants who had terrible phobias overcame her fears to become a champion pilot. Learning to fly and excelling at it helped her overcome emotional barriers from childhood when she was fearful and doubted her abilities.In her mid-20s, she was pursuing her PhD and felt a lack of confidence, so dropped out of the program. “It wasn't that I had failed life, but I was living a very narrow life. I was just saying no to everything that might be exciting or interesting. And I saw my life stretching in front of me as incredibly narrow. A colleague offered me a ride in a small airplane…It suddenly occurred to me that living life too sa

  • Kristian Lum | Applying Statistics to Promote Fairness and Transparency

    09/02/2021 Duration: 30min

    Kristian’s interest in statistics and algorithmic fairness has taken her on a winding career path from academia to business, to public service, and back to academia. As she has made different career changes, she didn’t decide between academia vs. industry vs. non-profit, it was more about the problem she was interested in working on at the moment, and what else is happening in her life. After she earned her PhD in Statistical Science from Duke University, she worked as a research professor at Virginia Tech where she did microsimulation and agent-based modelingin a simulation lab. After that, she tried a data visualization and analytics startup called DataPad that was quickly acquired. When she was thinking about her next step in her career, she wanted to do something with social impact.She was fascinated by the work of the Human Rights Data Analysis Group (HRDAG) that was applying statistical models to casualty data to estimate the number of undocumented conflict casualties. She spent a summer working for HRD

  • Lillian Carrasquillo | Using Human-Centric Data Science at Spotify

    03/12/2020 Duration: 38min

    A data scientist by training, Lillian brings a passion for human-centric machine learning and algorithmic effects to her role leading Spotify’s Personalized Home team. She also talks about the experience of leading her team from home during the pandemic.Lillian’s team of data scientists and user researchers collaborate with a mixed-methods research approach to try to discern user needs and create the best matches between listeners and creators. They observe and analyze user behaviors to develop product insights. Her team’s user researchers develop qualitative insights about user needs and then work with the data scientists to figure out the scale of this need or behavior. “What are users telling us they actually need in that moment by doing certain actions, by looking and exploring at certain recommended items?” She sees that Spotify can have a big impact on inspiring new experiences between listeners and the creators of music and podcasts. By sharing music and podcast stories with the world, Spotify often ma

  • Femke Vossepoel | Applying Data Assimilation Tools to COVID Forecasting Models

    27/10/2020 Duration: 39min

    After earning her PhD in Aerospace Engineering at Delft, Femke spent several years in oceanography, climate research, and subsurface modeling. She developed an expertise in data assimilation that she's now applying to improve COVID-19 pandemic forecasting models. Femke explains that data assimilation originated in weather forecasting, where a model is updated with the current day’s weather observations to provide a more accurate forecast for the next day. Data assimilation tools tune the model to provide a more accurate forecast. This concept can be applied in many areas including financial markets, the oil industry, and for COVID-19 research.To help improve COVID-19 forecasting, she is using a compartmental model where there are compartments for different groups: those susceptible to COVID-19, those exposed to it, those infected, those who recovered, those in quarantine, and those who are deceased. The model is like a set of boxes, and the transition from one box to the other is governed by an ordinary diffe

  • Francesca Dominici + Rachel Nethery | Using Data Science to Study Air Pollution Effect on COVID-19 Outcomes

    20/08/2020 Duration: 34min

    With the onset of the COVID-19 pandemic, Francesca Dominici and Rachel Nethery saw a way to connect the research they were doing on air pollution and health with the pandemic. They are studying the effects of air pollution exposure on different causes of hospitalization to see if pollution could increase a person’s vulnerability to COVID-19. While the research is at a preliminary stage, there is a lot of information that points towards the possibility that long-term exposure to air pollution could increase the mortality risk for COVID-19. Since this research is so important and timely, they have sometimes felt under pressure from the media and from high level government officials (from both parties) to answer questions with certainty about how many lives would be saved by reducing air pollution. They say it can be stressful to talk with certainty about their preliminary findings because the research is ongoing. But as a scientist, you should do your work with the utmost rigor and be able to communicate the un

  • Manisha Desai | The Importance of Data Integrity in COVID-19 Clinical Trials

    16/07/2020 Duration: 38min

    Manisha Desai is a professor of medicine (research) and of biomedical data science, and director of the Quantitative Sciences Unit at Stanford University. She is an expert in the design and analysis of clinical trials and epidemiologic studies across multiple diseases, including COVID-19. In this podcast, she provides some insights into the challenges and progress of COVID-19 clinical trials.When COVID happened, Manisha knew her team’s expertise in clinical trials would allow them to get up to speed quickly to study this new disease. So, she augmented her team with additional data science experts to focus on trials to identify safe and effective drugs to help prevent hospitalization and disease progression.“This whole period has been so anxiety-inducing for so many of us, it's actually been nice to focus that energy and anxiety towards work that could be part of the solution to some of the crisis,” she says. In studies so far, they have learned that there's a real disparity in outcomes by ethnicity and race a

  • Newsha Ajami | Improving Urban Water Systems Through Data Science, Public Policy and Engineering

    03/06/2020 Duration: 33min

    Newsha Ajami is a hydrologist specializing in sustainable water resource management, water policy, the water-energy-food nexus, and urban water strategy.When she was studying hydrology in grad school, she took a water policy class that changed the trajectory of her career. “I would say that was one of the most important events in my professional career. I realized that laws and policies are what change the way we manage resources,” she says. All the data optimization and modeling means nothing unless you can understand the policy layer imposed on how our natural systems operate. This interdisciplinary approach guides her research at Stanford’s Urban Water Policy group where she brings together expertise in hydrology, data science, engineering, public policy, human behavior and economics to improve urban water systems. Newsha explains that we’ve spent a lot of time focusing on building more capacity to meet increasing demand for water because our 20th century approach to water resource management has been very

  • Andrea Gagliano | The Intersection of Arts and Technology

    14/05/2020 Duration: 31min

    When Andrea was studying math as an undergrad, she was required to take an arts class in order to graduate, and soon discovered that she loved poetry. She learned that the process of writing a poem was often similar to solving a complex math problem—just starting with one part, and then doing one more, and gradually the rest is revealed. She enjoyed it so much that her first machine learning project in graduate school was on poetry/sonnet generation. Andrea wanted to blend technology and art in her career and Getty Images turned out to be the perfect place to combine her two interests. Getty Images curates and manages a huge library of images and videos that are used in editorial news, websites, social media, billboards and more. She started as a data scientist two years ago, and is now leading the AI/machine learning team to develop new tools to help clients more effectively use Getty’s creative assets. She explains that many of their creative clients come to the site and don't have the language to describe

  • Ya Xu | Using Data To Create Economic Opportunities For All Members Of Global Workforce

    15/04/2020 Duration: 40min

    Ya Xu manages LinkedIn’s global team of data scientists that manage data science projects across the company’s products, sales, marketing, economics, infrastructure, and operations. She says the company takes active responsibility over the data they collect to ensure fairness and protect privacy. They are very proactive about how they maintain their members’ trust, either with how they share the data externally or leverage the data to create opportunities.LinkedIn’s fairness mission is that two people with equal talent should have an equal shot at opportunities. To reinforce this, they constantly test new products to determine if a new feature introduces any unintended consequences that might impact fairness. For example, LinkedIn’s referral button allows a job applicant to see if someone they know works in the company and ask for a referral. The unintended consequence is this feature will benefit individuals who have a big network vs. the general population. She says they typically have about 500-600 experim

page 2 from 3