My wonderful colleagues Lucy and Kiran wrote a wonderful blog post about anti-patterns in Data Lake projects.
Productionizing machine learning/AI/data science is a challenge. Not only are the outputs of machine-learning algorithms often compiled artifacts that need to be incorporated into existing production services, the languages and techniques used to develop these models are usually very different than those used in building the actual service. In this post, I want to explore how the degrees of freedom in versioning machine learning systems poses a unique challenge. I'll identify four key axes on which machine learning systems have a notion of version, along with some brief recommendations for how to simplify this a bit. ...
A study of the daily lives of violent far-right actors using court records to peer inside their living spaces.
A people’s history of the removal of statues honoring racists, slave owners, and colonizers, largely during the Uprising in the Summer of 2020 following the murder of George Floyd.
I am an advisory board member of The Prosecution Project, a data-driven project studying the patterns of prosecution of extremism, terrorism, and political violence.
The opinions shared on this page do not necessarily reflect the views of any organization listed above.