Projects

The project explores the technical protection of minors on the internet and the challenges in detecting illegal content such as CSAM and cyber grooming, particularly in the context of the planned EU regulations. The discussion around measures like scanning online communication raises questions about privacy and the accuracy of technologies, which still have high error rates. ATHENE provides an environment to develop and evaluate technological solutions to improve the protection of minors and comply with EU requirements. In the CHARM project, the state of the art is analyzed, demonstrators are created, and new protection methods are developed to better inform policymakers, businesses, and society.

Nowadays, news is increasingly spread and consumed via social media. Since most of the posts do not undergo any verification process prior to their publication, there is a strong risk that posts contain false information. The project CRISIS aims to examine posts in social media for disinformation, where information is present in the form of texts, images, videos, and audio recordings. Various machine learning analysis methods are to be employed in order to

trace the dissemination paths of (dis)information or to identify themes and trends (Social Media Analytics)
recognize maliciously "recycled" content and trace it back to its original source as well as to match spread information with previously conducted fact-checks (Semantic Similarity Analysis)
and to support manual fact-checking process by preselecting media or specific sections of media that are particularly worth checking (Check-Worthiness Analysis).

The results will be integrated into a demonstrator that will support practitioners, such as journalists and fact-checkers, in investigating and identifying disinformation.

The DREAM project studies different methods for recognizing and identifying synthesized or manipulated media content that have been generated through artificial intelligence. A special focus is placed on detecting manipulations in various forms of media, namely images, videos and audios which are generated with the intention to impersonate. So-called deepfakes are able to automatically replace faces appearing in images or videos with faces of any person using "deep learning". Images can be generated by text-to-image synthesis methods such as DALL-E, StableDiffusion or Midjourney. For videos, face swapping or facial reenactment techniques such as "lip-sync-attacks" can be used. For audio data, the voice of a specific target person is imitated, e.g. through voice conversion or text-to-speech synthesis, so that words can be put into their mouth. To gain a better understanding of multimodal manipulations, the project also involves generating these types of fake media.

The unprecedented capabilities of recent large language models such as ChatGPT and Bard lead to their increased use as writing assistants. However, since these models produce text that is often very difficult to distinguish from human-written content, they are also increasingly used for malicious purposes, such as automatically writing assignments, composing AI-generated scientific papers, spreading fake news, and conducting social engineering attacks. To combat these issues, this project focuses on developing Trustworthy and Explainable AI-generated Text Detection (TXAITD) technology. TXAITD aims to identify passages created by language models, providing explanations and fine-grained localization of AI usage within texts. This approach helps in differentiating between benign uses of AI as writing assistants and malicious activities. Unlike previous systems, TXAITD makes the detection process explainable and trustworthy to empower human users and decision-makers to make informed judgments about digital content, thereby enhancing societal and personal security. Ultimately, this project contributes to the secure digital transformation and the regulation of AI in text composition.

Projects in REVISE

ChARM: Chat Control, Age Verification and Resilience for Minors

CRISIS: Cross-Domain Disinformation Analysis

DREAM: Deepfake REcognition and Artificial Media

TXAITD: Trustworthy and Explainable AI-generated Text Detection

Contact