Analytics

A bot detection algorithm was created and customised for finding harmful bot content on the mainstream social media platform TikTok. To do so, Textgain built a social media GDPR-compliant monitoring pipeline with the use of a list of keywords with the most potentially polarising topics in order to further analyse the resulting database. Below are the findings;

On this page, one can find here the amount of suspected bot comments and their classification and a word cloud with the words used by toxic bot comments. Second, one can find information regarding the hash tags used by bots and a word cloud of the hash tags. Finally, one can adjust the desired time period to filter and see the corresponding information.

Glossary

A bot are software programmes that automate repetitive tasks, mimicking or replacing human actions at a faster pace.
Bots can be used for various purposes, including web crawling, chat interactions, but also to spread disinformation and hate.
0 - APPROPRIATE: no target
1 - INAPPROPRIATE: contains terms that are obscene, vulgar; but the text is not directed at any person specifically (has no target)
2 - OFFENSIVE: including offensive generalization, contempt, dehumanization, indirect offensive remarks
3 - VIOLENT: author threatens, indulges, desires or calls for physical violence against a target; it also includes calling for, denying or glorifying war crimes and crimes against humanity
These levels are used by the toxicity classifier to determine whether it will be included in the dataset as toxic. Once the content has been included, the EOOH uses a toxicity score between 0 - 1 to determine the toxicity of detected content.
- Profanity: rude language
- Ridicule: discrimination based on intelligence, limitations
- Contempt: dehumanizing language
- Racism: racism, ethnical discrimination
- Sexism: discrimination based on one’s sex, sexual preference, gender identity discriminatie op basis van sekse, geaardheid, gender identiteit.
- Politics: ideological discrimination
- Religion: religious discrimination
- Threat: aggression and threat in general, could be verbally or to incite violence
- Untruth: expressions to promote disinformation and conspiracies.
A hashtag is a word or phrase preceded by the “#” sign. Hashtags are used on social media to tag posts as part of a larger conversation or topic. Hashtags are searchable and serve a similar role to keywords.scription

Network visualization with data since August 1st, 2024

The circles represent hashtags and the lines represent connections between hashtags that are used by the same bot(s). Hashtag circles that are often seen together attract each other and vice versa resulting in a grouping of those hashtags. The bigger the size of the circle the more the hashtag appears in the data. The colour of the circles represents community, which is calculated using the Louvain method. A community is a hub in which the circles interact significantly more with each other than with circles outside of the community. The network as a whole displays bot message interaction among tiktok posts whereby the hashtags are a representation of bot-targeted topics.

Reports

Featured

Jul 29, 2025

Bot narratives: Israel - Gaza

Jul 29, 2025

Have a look at the findings regarding bot narratives: Israel - Gaza in 2025

Jul 29, 2025

Jul 4, 2025

Analysis Q2 2025 toxic bot comments

Jul 4, 2025

Have a look at the analysis of Q1 2025 regarding toxic bot comments.

Jul 4, 2025

Bot comments Report June 2025

Jul 4, 2025

Have a look at the findings of March 2025 regarding bot comments.

Jul 4, 2025

Bot comments Report May 2025

Jul 4, 2025

Have a look at the findings of March 2025 regarding bot comments.

Jul 4, 2025

Bot comments Report April 2025

Jul 4, 2025

Have a look at the findings of March 2025 regarding bot comments.

Jul 4, 2025

Apr 7, 2025

Analysis Q1 2025 toxic bot comments

Apr 7, 2025

Have a look at the analysis of Q1 2025 regarding toxic bot comments.

Apr 7, 2025

Bot comments Report March 2025

Apr 7, 2025

Have a look at the findings of March 2025 regarding bot comments.

Apr 7, 2025

Bot comments Report February 2025

Apr 7, 2025

Have a look at the findings of February 2025 regarding bot comments.

Apr 7, 2025

Bot comments Report January 2025

Apr 7, 2025

Have a look at the findings of January 2025 regarding bot comments.

Apr 7, 2025

Feb 10, 2025

Bot narratives: Israel - Gaza

Feb 10, 2025

Have a look at the findings regarding bot narratives: Israel - Gaza in 2024.

Feb 10, 2025

Bot narratives: American politics

Feb 10, 2025

Have a look at the findings regarding the bot narratives: American politics in 2024.

Feb 10, 2025

Jan 28, 2025

Bot comments Report August - December 2024

Jan 28, 2025

Have a look at the findings of August 2024 regarding bot comments.

Jan 28, 2025

Analysis Q3 2024 toxic bot comments

Jan 28, 2025

Have a look at the analysis of Q3 2024 regarding toxic bot comments.

Jan 28, 2025

Aug 2, 2024

Bot comments Report July 2024

Aug 2, 2024

Have a look at the findings of July 2024 regarding bot comments.

Aug 2, 2024

Jul 1, 2024

Bot comments Report June 2024

Jul 1, 2024

Have a look at the findings of June 2024 regarding bot comments.

Jul 1, 2024

Disclaimer

This data is sourced in collaboration with Textgain. Textgain was founded in 2015 as a spin-off of the University of Antwerp (Belgium). They specialize in the development of Artificial Intelligence that automatically detects and monitors harmful online societal trends and tensions, such as hate speech and disinformation. In 2016, Textgain gained significant attention for its efforts to detect jihadist propaganda on social media and the company has since then expanded its software stack to detect online signs of radicalization in all of its aspects, including extreme left and extreme right rhetoric. In 2021, Textgain became the coordinator of the European Observatory of Online Hate, an initiative to monitor online hate speech across the European Union. Check out their website here.

Textgain, as technological partner of IMSyPP, is tackling hate speech in a multidisciplinary fashion combining machine learning, computational social science and linguistic approaches to support a data-driven approach to hate speech regulation, prevention and awareness-raising. The goal of this initiative is automated detection and sustainable monitoring of hate speech. Therefore, they developed near real-time hate speech detection models tuned to language, culture and legislation, taking into account the context of the message.

Analytics

Glossary

Bots

Toxicity levels used in the toxicity classifier

Categories of Toxicity

Hash tag

Network visualization with data since August 1st, 2024

Reports