AINL 2020 - Program

AINL 2020: Conference Schedule

(Helsinki time)

Tutorial & Human-AI Workshop

Wednesday, October 7

10:00 – 10:15

Conference Opening

10:15 – 11:45

Tutorial. Finnish NLP in the Deep Learning Age

Filip Ginter; AI Scientist at Silo.AI and Assistant Professor at the University of Turku, TurkuNLP lab

In this workshop, we will review several of the available tools and resources for Finnish NLP. In particular, we will present the Turku Neural Parser Pipeline and the recent Turku NER system as primary examples of deep learning -based NLP tools for Finnish. We will also present numerous Finnish datasets and models, such as the Finnish BERT model FinBERT and their use. The workshop will contain several hands-on sessions during which the participants can test the tools and resources on their own in the Google colab environment.

11:45 – 12:00

Coffee Break

12:00 – 13:30

Tutorial. Finnish NLP in the Deep Learning Age

Filip Ginter; AI Scientist at Silo.AI and Assistant Professor at the University of Turku, TurkuNLP lab

Human-AI Workshop
See workshop talks abstracts here

14:30 – 14:50

Work Disability Risk Prediction with Machine Learning

Vili Huhta-Koivisto

14:50 – 15:05

Influence of Interactional Style on Affective Acceptance in Human-Chatbot Interaction. A Literature Review

David Dobrowsky, Lili Aunimo , Ilona Pezenka, Gerald Janous and Teresa Weber

15:05 – 15:20

Building a Chatbot: Architectures and Vectorization Methods

Anna Chizhik, Yulia Zherebtsova

15:20 – 15:35

Supporting the Inclusion of People with Asperger Syndrome: Building a Customizable Chatbot with Transfer Learning

Victoria Firsanova

15:35 – 15:50

Adopting AI-enhanced Chat for Personalizing Student Services in Higher Education

Asko Mononen, Ari Alamäki, Janne Kauttonen, Aarne Klemetti & Erkki Räsänen

Keynote & Paper Presentations

Thursday, October 8

10:00 – 11:30

Cross-Lingual Embeddings: a Babel Fish for Machine Learning Models

Marko Robnik-Šikonja, Professor of Computer Science and Informatics and Head of Artificial Intelligence Chair at the University of Ljubljana, Faculty of Computer and Information Science

Currently, the most successful machine learning methods are numeric, e.g., deep neural networks or SVMs. If we are to harness the power of successful numeric deep learning approaches for symbolic data such as texts or graphs, the symbolic data has to be embedded into a vector space, suitable for numeric algorithms. The embeddings shall preserve the information in the form of similarities and relations contained in the original data by encoding it into distances and directions in the numeric space. Typically, these vector representations are obtained with neural networks trained for the task of language modelling. As it turns out, the resulting numeric spaces are similar between different languages and can be mapped with approaches called cross-lingual embeddings.

We are going to present ideas of supervised, unsupervised, and semi-supervised cross-lingual embeddings. We will focus on recent contextual embeddings which assure that the same word is mapped to different vectors based on the context. We will describe how to build and fine-tune contextual embeddings, such as ELMo and BERT, and present examples of training a model in a well-resourced language such as English and transfer it to less-resourced language such as Finnish. We will describe applications of cross-lingual transfer in text classifiers and abstractive summarizers.

Paper Presentations

12:00 – 12:20

Sentiment Detection in Socio-Political Discussions on Russian Social Media

Olessia Koltsova, Svetlana Alexeeva, Sergei Pashakhin, Sergei Koltsov PolSentiLex

12:20 – 12:40

Automatic Detection of Hidden Communities in the Texts of Russian Social Network Corpus

Ivan Mamaev, Olga Mitrofanova

12:40 – 13:00

Dialog Modelling Experiments with Finnish One-to-One Chat Data

Janne Kauttonen, Lili Aunimo

13:05 – 13:25

An Explanation Method for Black-Box Machine Learning Survival Models Using the Chebyshev Distance

Lev Utkin, Maxim Kovalev, Ernest Kasimov

13:25 – 13:45

Unsupervised Neural Aspect Extraction with Related Terms

Timur Sokhin, Maria Khodorchenko, Nikolay Butakov

13:45 – 14:00

Advances of Transformer-Based Models for News Headline Generation

Alexey Bukhtiyarov, Ilya Gusev

14:00 – 14:20

Coffee Break

14:20 – 14:40

Predicting Eurovision Song Contest Results Using Sentiment Analysis

Iiro Kumpulainen, Eemil Praks, Tenho Korhonen, Anqi Ni, Ville Rissanen, Jouko Vankka

14:40 – 15:00

Improving Results on Russian Sentiment Datasets

Anton Golubev, Natalia Loukachevitch

15:00 – 15:20

Dataset for Automatic Summarization of Russian News

Ilya Gusev

15:20 – 15:35

Dataset for Evaluation of Mathematical Reasoning Abilities in Russian

Mikhail Nefedov

15:35 – 15:45

Coffee Break

15:45 – 16:05

Case Law Judgments by Using Other Judgments as a Query

Sami Sarsa, Eero Hyvönen Searching

16:05 – 16:25

Matching LIWC with Russian Thesauri: An Exploratory Study

Polina Panicheva, Tatiana Litvinova

16:25 – 16:45

Finding New Multiword Expressions for Existing Thesaurus

Petr Rossyaykin, Natalia Loukachevitch

16:45 – 17:00

GenPR: Generative PageRank Framework for Semi-Supervised Learning on Citation Graphs

Mikhail Kamalov, Konstantin Avrachenkov

Shared Task & Poster Presentations

Friday, October 9

10:00 – 10:20

The First Chinese-Russian Machine Translation Challenge

Valentin Malykh

10:20 – 10:40

ML Powered Tool for Parallel Texts Alignment

Sergey Averkiev

10:40 – 11:00

Challenges in Zh-Ru MT Task and How to Overcome Them

Yuriy Nazarov

Poster Presentations

11:15 – 11:30

Decentralized Learning for Text Mining

Kendrick Cetina Nuria García-Santa

11:30 – 11:45

Building Parallel Corpora Using Multilingual Sentence Embeddings

Sergei Averkiev

11:45 – 12:00

Publication Date Estimation for Scientific Papers

Andrey Grabovoy

Feel free to contact us at ainlevent@gmail.com

Cover photo from pixabay