703: How Data Happened: A History, with Columbia Prof. Chris Wiggins
Aug 8, 2023
auto_awesome
Columbia Prof. Chris Wiggins discusses data's power dynamics, biology's transformation with data, and the New York Times data science team's innovative tech stack. Topics include humanities in data science, rearranging power through data, Bayes theorem controversy, data ethics, biology adapting to data science, and the NYT tech stack.
Humanities integration crucial in data science education.
Data science impacts power dynamics and ethics in society.
Biology adopts data science for transformative genetic insights.
Deep dives
Importance of Understanding Data History
Chris Wiggins details the history of data and statistics, highlighting its importance from centuries ago until the present. He emphasizes a lack of exposure to humanities among data scientists and discusses the controversial nature of Bayesian statistics, urging for a broader understanding to tackle current data science issues and advancements at institutions like Columbia University and The New York Times.
Bridging Humanistic and Quantitative Disciplines in Data Science Education
Wiggins addresses the absence of humanities in data science curricula, tracing historical specialization in education. He advocates for interdisciplinary education, combining technical skills with collaboration and strategic thinking, underscoring the significance of communication and ethical considerations in data science practice.
Power Dynamics and Ethical Challenges in Modern Data Science
Wiggins explores how data science reshapes power dynamics in society and the ethical implications of this shift. He discusses the complex interplay between ethics, corporate data use, and the perception of data as truth, shedding light on the need for critical thinking about data design choices and responsible data practices in the era of data-driven decision-making.
Applied Ethics Framework in Corporate Environments
In the podcast episode, the discussion revolves around the application of an ethics framework in corporate settings for data-driven algorithms. It emphasizes the importance of incorporating ethical principles like respect for persons, beneficence, and justice into the design and deployment of algorithms. The speakers stress the significance of reconciling ethical principles that may be in conflict and implementing processes such as ethical audits to ensure alignment with these principles. The episode underscores the need for organizational structures within companies to oversee ethical considerations for data-powered products, highlighting the dynamic nature of decision-making and monitoring in a corporate context.
Evolution of Data Science in Biological Research
Another focal point in the podcast is the evolution of data science in biological research, tracing the shift towards data-driven approaches over the past few decades. The conversation delves into the transformative impact of computational methods and AI on understanding biological systems, particularly highlighting the transition from traditional to data-intensive biology. The discussion touches on how advancements in sequencing technology have revolutionized genomics and enabled deeper insights into genotype-phenotype relationships. It also reflects on the changing perceptions within the biology community towards data analysis, emphasizing the role of collaboration between data scientists and biologists to leverage machine learning for biological research.
Statistics history, interdisciplinarity, and data and society. Chris Wiggins talks with Jon Krohn about the power dynamics of data, the transformation of the field of biology through data-driven approaches to genetic sequencing, and the New York Times’ data science team’s cutting-edge approach to accommodating its tech stack.
This episode is brought to you by the AWS Insiders Podcast and by Modelbit, for deploying models in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
In this episode you will learn: • The importance of the humanities in data science [09:18] • How data science “rearranges” power [17:19] • An overview of How Data Happened [20:36] • The controversial nature of Bayes theorem [29:16] • Why we need to consider data ethics [34:00] • How biology came to adopt data science into its field [45:44] • The data science tech stack at the New York Times [49:18]