Detect bad words, swear words and profanity in a given text.
Currently works with a mostly English database which includes "leetspeak".
The filter uses natural language processing (NLP) to decode the content into logical words ignoring punctuation, case, formatting, etc.
We also apply word transformations to detect obfuscation of bad words like repeating characters, spurious whitespace and special characters.
As well as detection and extraction of bad words you can also use this API to censor bad words from the supplied text.
The content to scan. This can be either a URL to load from, a file upload or an HTML content string
The character to use to censor out the bad words found
Which catalog of bad words to use, we currently maintain two bad word catalogs:
strict - the largest database of bad words which includes profanity, obscenity, sexual, rude, cuss, dirty, swear and objectionable words and phrases. This catalog is suitable for environments of all ages including educational or children's content
obscene - like the strict catalog but does not include any mild profanities, idiomatic phrases or words which are considered formal terminology. This catalog is suitable for adult environments where certain types of bad words are considered OK
Does the text contain bad words
Total number of bad words detected
An array of the bad words found
The censored content (only set if censor-character has been set)