Hackathon
October 17, 2019
Session leader: Annika Rockenberger
Topic: Preparing something cool for the Carpentries Tagathon in Week 44
Hackathon - Tagathon #
The Carpentries are inviting to a community initative in week 44 called “Hacktoberfest”. It is about tagging the many blog posts on the three Carpentries websites: Software Carpentry, Data Carpentry, and Library Carpentry. The aim is to make the posts more findable by assigning topics.
Combining the skills taught in The Carpentries with the power of Digital Humanities, this StudyGroup will attempt to create a set of tags for the blog posts based on computational text analysis. How does one read 1 million blog posts? Not at all! A machine can do this much better (and faster)!
We will use pandoc
to convert the blog posts written in markdown
to a format that the online topic explorer VoyantTools can ingest and see what it reveals about the content of the blogs. We will then take a look at the tag set that the Carpentries has prepared and compare the two. The Final product could be a semi-automatic tagger based on text analysis.
In order to participate, you don’t need to know any of the software and tools we are going to use. You should, however, bring your laptop and your curiosity and enthusiasm to experiment!