Can artificial intelligence encourage good behavior among internet users? – The Jakarta Post – Jakarta Post

Hostile and hateful remarks are thick on the ground on social networks in spite of persistent efforts by Facebook, Twitter, Reddit and YouTube to tone them down. Now researchers at the OpenWeb platform have turned to artificial intelligence to moderate internet users’ comments before they are even posted.

The study conducted by OpenWeb and Perspective API analyzed 400,000 comments that some 50,000 users were preparing to post on sites like AOL, Salon, Newsweek, RT and Sky Sports.

Some of these users received a feedback message or nudge from a machine learning algorithm to the effect that the text they were preparing to post might be insulting, or against the rules for the forum they were using. Instead of rejecting comments it found to be suspect, the moderation algorithm then invited their authors to reformulate what they had written.

“Let’s keep the conversation civil. Please remove any inappropriate language from your comment,” was a message prompt or “Some members of the community may find your comment offensive. Try again?”

In response to this kind of feedback, a third of internet users (34 percent) immediately modified their comments, while 36% went ahead and posted their comments anyway, taking the risk that they might be rejected by the moderating algorithm. Even more surprisingly, some users made modifications that did not necessarily make their comments kinder or less hostile.

Read also: Indonesia ranks 57th in Inclusive Internet Index, among the lowest in the world

Using tricks to get around the algorithm

While close to 30 percent of users opted to accept the feedback message and delete potentially offensive text from their comments, more than a quarter (25.8 percent) attempted to dupe the moderating algorithm.

READ  Instagram DOWN: Server status latest, Insta not working and news feed issues - Express

Deliberate spelling errors and adding spaces between letters were just two of the tricks they used to modify the form of their comments while leaving their content unchanged.

The 400,000 comments analyzed in the study are, however, a mere drop in the ocean when compared to the millions that are posted daily on the internet, some of which carry offensive and insulting language.

Faced with this situation, tech giants are boosting their efforts to combat online hate more effectively. It is a fight in which artificial intelligence can make a useful but, for now at least, imperfect contribution.

Your premium period will expire in 0 day(s)

close x

Subscribe to get unlimited access Get 50% off now



Please enter your comment!
Please enter your name here