Abstract
This work discusses and contains content that may be offensive or unsettling. Hateful communication has always been part of human interaction, even before the advent of social media. Nowadays, offensive content is spreading faster and wider through digital communication channels. To help improve regulation of hate speech, we introduce ProperBERT, a fine-tuned BERT model for hate speech and offensive language detection specific to English. To ensure the portability of our model, five data sets from literature were combined to train ProperBERT. The pooled dataset contains racist, homophobic, misogynistic and generally offensive statements. Due to the variety of statements, which differ mainly in the target the hate is aimed at and the obviousness of the hate, a sufficiently robust model was trained. ProperBERT shows stability on data sets that have not been used for training, while remaining efficiently usable due to its compact size. By performing portability tests on data sets not used for fine-tuning, it is shown that fine-tuning on large scale and varied data leads to increased model portability.
| Originalsprache | Englisch |
|---|---|
| Titel | International Conference on Electrical, Computer, Communications and Mechatronics Engineering, ICECCME 2022 |
| Herausgeber (Verlag) | Institute of Electrical and Electronics Engineers Inc. |
| Seiten | 1-6 |
| Seitenumfang | 6 |
| ISBN (elektronisch) | 9781665470957 |
| ISBN (Print) | 978-1-6654-7096-4 |
| DOIs | |
| Publikationsstatus | Veröffentlicht - 2022 |
| Veranstaltung | 2022 International Conference on Electrical, Computer, Communications and Mechatronics Engineering, ICECCME 2022 - Male, Malediven Dauer: 16 Nov. 2022 → 18 Nov. 2022 |
Publikationsreihe
| Name | International Conference on Electrical, Computer, Communications and Mechatronics Engineering, ICECCME 2022 |
|---|
Konferenz
| Konferenz | 2022 International Conference on Electrical, Computer, Communications and Mechatronics Engineering, ICECCME 2022 |
|---|---|
| Land/Gebiet | Malediven |
| Ort | Male |
| Zeitraum | 16.11.2022 → 18.11.2022 |
UN SDGs
Dieser Output leistet einen Beitrag zu folgendem(n) Ziel(en) für nachhaltige Entwicklung
-
SDG 7 – Erschwingliche und saubere Energie
Schlagwörter
- Training
- Mechatronics
- Social networking (online)
- Computational modeling
- Hate speech
- Speech recognition
- Digital communication
Fingerprint
Untersuchen Sie die Forschungsthemen von „ProperBERT - Proactive Recognition of Offensive Phrasing for Effective Regulation“. Zusammen bilden sie einen einzigartigen Fingerprint.Zitieren
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver