Resources

Code, notebooks, and data I made openly available.

name	type	description

augtox	paper repo	All experiments described in this paper.
neutralrewriter	paper repo	All experiments described in this
reap	paper repo	All experiments described in this paper.
cacapo-dataset	data	Multilingual, multi-domain data-to-text generation dataset.
amica	paper repo	All experiments described in this paper.
dutch-embeddings	data	Various Dutch word embeddings trained with word2vec.
style-obfuscation	paper repo	All experiments described in this paper.
simple-queries	paper repo	All experiments and data described in this paper.
toku	demo	Author profiling based on Simple Queries (see above).
omesa	code (sci)	Small framework for reproducible Text Mining research.
topbox	code (sci)	Wrapper for Labelled Latent Dirichlet Allocation (L-LDA).
sakuin	code (tool)	Web directory indexing and file sharing with Python.
markdoc	code (tool)	Convert NumPy-styled Python docstring to Markdown.
ebacs	code (sci)	Minimalistic conference manager.
ec2latex	code (sci)	XML to LaTeX book of abstracts.
nancho	code (fun)	Mute music player sound on browser audio (Ubuntu).