About me
I joined the the Research team at the Wikimedia Foundation on October 2018 as a Research Scientist. I currently live in the glorious New York City, NY, USA.
My work
My background is in geography and human-computer interaction, with a special focus on understanding (and trying to do something about) how structural inequalities find their way online and into algorithmic systems. Since joining WMF, I have also been heavily involved in research towards better understanding reader needs and behavior, how to model and make predictions about Wikimedia content in a language-agnostic manner, and the impact of external re-use of Wikimedia content.
Contact me
- email: isaac@wikimedia.org
- irc: isaacj
- phabricator:
- github:
Tools
A collection of tools that I've built (or helped build) for showcasing some of our research work:
And specifically, a number of Python packages:
- : parsing Wikipedia HTML (parsoid output)
- : structured analysis of wikitext diffs
- : word / sentence tokenization for Wikimedia content
- : parsing Wikimedia SQL dumps
Musings
Various writings about topics relevant to Wikimedia data, research, etc.
- Language modeling:
- Data practices:
Projects
Last updated on 11/16/2023
ActiveActive projects that I am currently working on: |
CompletedCompleted research projects and reports:
|
BackburnerProjects that I've started, but had to put down for the moment: |