Listed in reverse chronological order are all relevant academic activities since the start of my PhD position.
Collaboration paper submitted to EMNLP 2021.
Attended EACL 2021 (virtually).
Gave an Introduction to AI and ‘Introduction to Machine Learning’ guest lecture at the Tilburg Law School.
Supervised theses in Data Science: Anke Bodewes, Cas Goos, Cas van den Hurk, Leon Korbee, and Ronald van Os.
Taught the Spring semester of RS: Data Processing (880254).
Gave a talk about my work on Invasive Artificial Intelligence at the kick-off of the TAISIG Talks.
Adversarial Stylometry in the Wild: Transferable Lexical Substitution Attacks on Author Profiling accepted to EACL 2021 (2.5, 3.5, 4.5).
Taught the Fall semester of RS: Spatiotemportal Data Analysis (800880).
The CACAPO Dataset: A Multilingual, Multi-Domain Dataset for NeuralPipeline and End-to-End Data-to-Text Generation accepted to INLG 2020 and CONLL 2020 (retracted latter).
Collaboration paper submitted to INLG / CONLL 2020.
Submitted paper to EACL 2021.
Supervised theses in Data Science (described by general topic): Robbin Breeuwer — extending adversarial stylometry via lexical subsitution attacks, Max Knegt — debiasing racial hate speech using distantly supervision, Mert Torun — comparing polls with Twitter sentiment for the 2020 US elections.
Taught the Fall semester of Data Mining for Business and Governance (880022), and RS: Data Processing (880254).
Current Limitations in Cyberbullying Detection: on Evaluation Criteria, Reproducibility, and Data Scarcity’ published in Language Resources and Evaluation.
Paper rejected for EMNLP 2020 (1.5, 3.0, 3.5, 4.0).
Supervised theses in Data Science (described by general topic): Myrthe Reuver — predicting complaint labels for short clinical texts.
Submitted paper to EMNLP 2020.
Became a founding member of the Culture Committee at the Department of Cognitive Science & Artifical Intelligence, and Head of the PhD Culture Board. Started organizing weekly meetings with Lieke Gelderloos, Paris Mavromoustakos Blom, and George Aalbers
‘Current Limitations in Cyberbullying Detection: on Evaluation Criteria, Reproducibility, and Data Scarcity’ accepted to Language Resources and Evaluation with minor revisions.
‘Current Limitations in Cyberbullying Detection: on Evaluation Criteria, Reproducibility, and Data Scarcity’ available on arXiv.
Submitted paper to Language Resources and Evaluation.
Gave a guest lecture on Data Quality for the Tax & Technology course at the Vrije Universiteit.
Attended ATILA 2019.
Reviewed for CONLL 2019.
‘Towards Replication in Computational Cognitive Modeling: A Machine Learning Perspective’ accepted as commentary paper in Computational Brain & Behaviour.
Submitted commentary to Computational Brain & Behaviour.
Attended the Blackbox@NL workshop at JADS.
Gave a talk on (Adversarial) Computational Stylometry for our department’s colloquium series.
Gave an Introduction to AI guest lecture at the Tilburg Law School.
Attended ICT with Industry 2019 at the Lorentz Center in Leiden, predicting public importance of news for the Persgroep. My observations here.
Presented Style Obfuscation by Invarance at ATILA 2018.
‘Automatic detection of cyberbullying in social media text’ accepted in PLoS ONE.
Supervising theses in Data Science (described by general topic): Gytha Muller — Improving Cross-Domain Distant Profiling, Ruben van de Kerkhof — Stylometric Features in Sarcasm Detection.
Teaching Assistant (provided the Decision Tree lecture) for Machine Learning (880083).
Taught the Fall semester of Data Mining for Business and Governance (880022).
Invited lecturer at the Vrije Universiteit, providing the Data Quality lecture for the Tax & Technology course.
Part of the local organisation committee for BNAIC2018/BENELEARN2018.
‘Style Obfuscation by Invariance’ accepted for COLING 2018.
Supervised Master theses in Data Science (described by general topic): Joey Dokter — Predicting Customer Conversion for In-House Advertisements (in collaboration with OnMarc), Alejandra Hernández Réjon — Gender Differences in Cyberbullying Detection, Kostas Stoitas — Evaluating Word Embeddings for Cyberbullying Detection, Tzoulian Prodromos Ninas — Investigating Cyberbullying and Toxicity, Anwar Amezoug — Out of Domain Performance of Profiling, Coen van Duijnhoven — Robustness of Simple Queries on Political Preference Detection, Martijn Oele — Linguistically Informed Local Changes for Author Obfuscation, Jan de Rooij — Replicating Style Obfuscation by Invariance.
Blog shared by KDNuggets, again:
Euclidean vs. Cosine Distance— KDnuggets (@kdnuggets) March 23, 2018
"This post was written as a reply to a question asked in the Data Mining (https://t.co/oCLBFN8ELc) course: When to use the cosine similarity?" https://t.co/Wf3GjpWn0t pic.twitter.com/MOT7Pw2AE2
Submitted long paper to COLING 2018.
Part of the Scientific Committee of LREC 2018; reviewed for main conference and TA-COS workshop.
Presented ‘Attribute Obfuscation with Gradient Reversal’ at CLIN 28.
Taught the Spring semester of Data Mining for Business and Governance (880022).
- Blog shared by KDNuggets, twice:
Scikit-learn Pipeline Persistence and JSON Serialization Part II https://t.co/q0DdKRHR45 pic.twitter.com/WMQUqhsTf2— KDnuggets (@kdnuggets) December 27, 2017
Scikit-learn Pipeline Persistence and JSON Serialization https://t.co/eLJ5jiPxRd pic.twitter.com/EiimZV6zz8— KDnuggets (@kdnuggets) December 27, 2017
Presented ‘Distantly Supervised Prediction of Demographic Information on Social Media’ and tōku at the TiCC PhD Day 2017 and won the Best Demo Award.
Organized ATILA’17 in Tilburg.
Was part of the jury at the Xomnia Datathon against cyberbullying.
Taught Fall semester Data Mining for Business and Governance (880022).
Attended EMNLP 2017 in Copenhagen.
‘Simple Queries as Distant Labels for Detecting Gender on Twitter’ accepted for W-NUT 2017.
Organized informal NLP with Deep Learning summer school at Tilburg University.
Co-reviewed for CONLL 2017, and EMNLP 2017.
Submitted short paper to EMNLP 2017.
Co-reviewed for EACL 2017.
Supervised Master theses in Data Science: Evie Izeboud — Prediction of Job Transition Using Publicly Available Professional Profiles (in collaboration with 8vance), Thomas Rockx — Profile pictures as predictor for network size on Twitter, Vannesa Berhitoe — Multi-label emotion detection on Twitter.
Taught Spring semester Social Data Mining (880022).
Built a concept version for the CS&AI group’s webpage.
Attended ATILA 2016.
Started teaching Fall semester Text Mining (880091) and Social Data Mining (880022) in context of the new Data Science master. Materials for both courses are open-source.
Moved to the CS&AI group at Tilburg University as part Lecturer, part joint PhD candidate with CLiPS at the University of Antwerp under the supervision of Eric Postma, Grzegorz Chrupała, and Walter Daelemans.
Attended the 2nd Summer School on Integrating Vision & Language: Deep Learning (iV&L).
Gave a LaTeX tutorial for the course Computational Models of Language Understanding.
‘Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource’ accepted for LREC 2016.
Part of the Scientific Committee of LREC 2016; reviewed for main conference and TA-COS workshop.
Wrote a Python docstring to Markdown convertor (markdoc).
Made Omesa open-source.
Submitted ‘Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource’ to LREC.
Wrote an author profiling module and demo environment for AMiCA (profl).
Presented ‘Domain Adaptation of Simulated Data for Cyberbullying Research.’ and demonstrated ‘Shed: a Framework for Reproducible Text Mining’ at ATILA 2015.
Organized ATILA 2015 and built its website.
Wrote a Python wrapper for Stanford’s Topic Modelling Toolbox (topbox).
Participated in the Lisbon Machine Learning School (LxMLS).
Presented ‘Modelling Discussion Topics to Improve News Article Tagging.’ and demonstrated ‘Ebacs: a Minimalistic Conference Manager’ at DHBenelux.
Presented ‘Topic Modelling in Online Discussions’ at CLIN 2015 and accepted the STIL prize.
Accepted the Leo Coolen award at Tilburg University.
Made the book of abstracts for CLIN25 and developed ec2latex in the process.
Attended ATILA 2014.
Gave a guest lecture for Tilburg University’s course Social Intelligence about my research and applying Text Mining for business purposes.
Started as a PhD candidate on AMiCA project.