> it excludes letters used for profanity That doesn't seem possible. How would t...

no_wizard · on Nov 25, 2023

This is the implementation: https://github.com/CyberAP/nanoid-dictionary

We use it in a highly internationalized product spanning multiple languages and haven’t yet ran into a complaint or value on audit that would constitute something offense in any language per our intl content teams anyway.

That isn’t to say it’s 100% (and simply enough we don’t audit every single URL) but I suspect we would have gotten at least a user heads up by now

Never the less we are moving our approach to uuids that get base32 encoded for some of our use case for this. They’re easier to work for us in many scenarios

Silasdev · on Nov 25, 2023

It's particularly funny because their example docs for .NET outputs "B4aajs", which to any Swedish l33t speaking individual, would read "Bajs", which means "shit"

owyn · on Nov 26, 2023

Somewhere there's a database for every bad word and every bad typo in every language and that one just got added.

Sharlin · on Nov 25, 2023

Omit vowels and you're 90% of the way there; omit the vowel-looking digits 0,1,3,4 and you're probably >99% of the way there.

gberger · on Nov 25, 2023

Sharlin · on Nov 25, 2023

Which is, evidently, why nanoids also excludes x and X, as well as v and V (fvck).

cdelsolar · on Nov 27, 2023

njharman · on Nov 25, 2023

> That doesn't seem possible. How would that work?

agree; b00b, DlCK, cntfcker

But I suppose, if user doesn't get to craft input, the collision space of converted numerical ids and words like above is sufficiently small to be ignorable.

Sharlin · on Nov 25, 2023

Besides vowels, nanoid excludes 0, 1, 3, 4, 5, I, l, x, X, v, V, and other lookalikes, so the chances of generating something naughty in any language are close to zero.

jl6 · on Nov 26, 2023

Humans have a high capacity for spotting rudeness. Nanoid’s nolookalikesSafe alphabet would allow blwjb69FKmyD7CK.

(Sorry)

Two4 · on Nov 26, 2023

Buy me drink first, jeez

livrem · on Nov 25, 2023

Looks like the dictionaries used are from this file?

https://registry.npmjs.org/naughty-words/-/naughty-words-1.2...

From a quick look, the lists are pretty short, except for the one with English words that at least have some 404 words, but I can imagine there are far more bad words that you want to avoid than just those?

ape4 · on Nov 25, 2023

Here's the C++ of the sqid blocked words https://github.com/sqids/sqids-cpp/blob/main/include/sqids/b...