My name is Lukas Gienapp and I am a Research Assistant at the TEMIR group at Leipzig University. I am passionate about all things in Text Mining, Data Science, and Information Retrieval. I mainly work on deep learning approaches for text retrieval.
Publications
2023
- Theresa Elstner, Frank Loebe, Yamen Ajjour, Christopher Akiki, Alexander Bondarenko, Maik Fröbe, Lukas Gienapp, Nikolay Kolyada, Janis Mohr, Stephan Sandfuchs, Matti Wiegmann, Jörg Frochte, Nicola Ferro, Sven Hofmann, Benno Stein, Matthias Hagen, and Martin Potthast. Shared Tasks as Tutorials: A Methodical Approach. In Thirteenth AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI 23), 2023. EAAI.
2022
- Phillipp Sauer, Janek Bevendorff, Lukas Gienapp, Wolfgang Kircheis, Erik Körner, Benno Stein, and Martin Potthast. SMAuC – The Scientific Multi-Authorship Corpus. CoRR, abs/2211.02477, November 2022.
- Lukas Gienapp, Maik Fröbe, Matthias Hagen, and Martin Potthast. Sparse Pairwise Re-ranking with Pre-trained Transformers. In 2022 ACM SIGIR International Conference on the Theory of Information Retrieval (ICTIR ‘22), July 2022. ACM.
- Christopher Akiki, Lukas Gienapp, and Martin Potthast. Tracking Discourse Influence in Darknet Forums. CoRR, abs/2202.02081, February 2022.
2021
- Lukas Gienapp, Wolfgang Kircheis, Bjarne Sievers, Benno Stein, and Martin Potthast. STEREO: Scientific Text Reuse in Open Access Publications. CoRR, abs/2112.11800, December 2021.
- Maik Fröbe, Matthias Hagen, Janek Bevendorff, Michael Völske, Benno Stein, Christopher Schröder, Robby Wagner, Lukas Gienapp, and Martin Potthast. The Impact of Main Content Extraction on Near-Duplicate Detection. In Andreas Wagner, Christian Guetl, Michael Granitzer, and Stefan Voigt, editors, 3nd International Symposium on Open Search Technology (OSSYM 2021), October 2021. International Open Search Symposium.
- Maik Fröbe, Janek Bevendorff, Lukas Gienapp, Michael Völske, Benno Stein, Martin Potthast, and Matthias Hagen. CopyCat: Near-Duplicates within and between the ClueWeb and the Common Crawl. In 44th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2021), July 2021. ACM.
- Lukas Gienapp. Quality-aware Argument Retrieval with Topical Clustering. In Working Notes Papers of the CLEF 2021 Evaluation Labs, September 2021.
- Alexander Bondarenko, Lukas Gienapp, Maik Fröbe, Meriem Beloucif, Yamen Ajjour, Alexander Panchenko, Chris Biemann, Benno Stein, Henning Wachsmuth, Martin Potthast, and Matthias Hagen. Overview of Touché 2021: Argument Retrieval. In Djoerd Hiemstra et al, editors, Advances in Information Retrieval. 43rd European Conference on IR Research (ECIR 2021), volume 12036 of Lecture Notes in Computer Science, March 2021. Springer.
- Lukas Gienapp, Clara Kruckenberg, and Manuel Burghardt. Topological properties of music collaboration networks: The case of Jazz and Hip Hop. DHQ: Digital Humanities Quarterly 15.1, 2021.
2020
- Lukas Gienapp, Maik Fröbe, Matthias Hagen, and Martin Potthast. The Impact of Negative Relevance Judgments on NDCG. In Mathieu d’Aquin et al, editors, 29th ACM International Conference on Information and Knowledge Management (CIKM 2020), pages 2037-2040, October 2020. ACM.
- Lukas Gienapp, Benno Stein, Matthias Hagen, and Martin Potthast. Estimating Topic Difficulty Using Normalized Discounted Cumulated Gain. In Mathieu d’Aquin et al, editors, 29th ACM International Conference on Information and Knowledge Management (CIKM 2020), pages 2033-2036, October 2020. ACM.
- Alexander Bondarenko, Maik Fröbe, Meriem Beloucif, Lukas Gienapp, Yamen Ajjour, Alexander Panchenko, Chris Biemann, Benno Stein, Henning Wachsmuth, Martin Potthast, and Matthias Hagen. Overview of Touché 2020: Argument Retrieval. In Linda Cappellato, Carsten Eickhoff, Nicola Ferro, and Aurélie Névéol, editors, Working Notes Papers of the CLEF 2020 Evaluation Labs, volume 2696 of CEUR Workshop Proceedings, September 2020.
- Lukas Gienapp, Benno Stein, Matthias Hagen, and Martin Potthast. Efficient Pairwise Annotation of Argument Quality. In 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020), pages 5772-5781, July 2020. Association for Computational Linguistics.
2019
- Martin Potthast, Lukas Gienapp, Florian Euchner, Nick Heilenkötter, Nico Weidmann, Henning Wachsmuth, Benno Stein, and Matthias Hagen. Argument Search: Assessing Argument Relevance. In: 42nd International ACM Conference on Research and Development in Information Retrieval (SIGIR 2019), July 2019. ACM.
Teaching Experience
I have given seminars and lab sessions on both bachelors and masters level covering topics in ML, NLP, and IR:
- Foundations of Machine Learning
- Big Data & Language Technologies
- Advanced Information Retrieval
- Information Retrieval
Professional Experience
since 2019, Text Mining & Retrieval Group, Leipzig University
Research Assistant
2017-2019, Institute for Sociology, Leipzig University
Student Assistant
2017-2019, Institute for Translatology, Leipzig University
Student Assistant
Educational Experience
2019 - 2022, Leipzig University
M.Sc. Data Science
2019 - 2022, Leipzig University
M.Sc. Digital Humanities
2016-2019, Leipzig University
B.Sc. Digital Humanities
2015-2016, Leipzig University
B.A. Linguistics
until 2014, Gymnasium Carolinum Bernburg
Awards & Grants
- SIGIR Student Travel Grant of the Special Interest Group on Information Retrieval (SIGIR) for the 29th ACM International Conference on Information and Knowledge Management (CIKM 2020) for the paper Estimating Topic Difficulty Using Normalized Discounted Cumulated Gain.
- SIGIR Student Travel Grant of the Special Interest Group on Information Retrieval (SIGIR) for the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2019) for the paper Argument Search: Assessing Argument Relevance.