Hmmm.. The more I think about this the more any font kerning is likely a major l...

worewood · 2025-12-24T01:15:53 1766538953

There was a recent vulnerability, where researchers were able to extract information from an encrypted chat session from an LLM, by analyzing packet size/timings of the underlying SSL connection. A classic side-channel attack. Seems possible to draw a parallel between the two.

dylan604 · 2025-12-24T05:36:33 1766554593

> the more any font kerning is likely a major leak for redaction

Now I want a font that randomly adjusts the kerning automagically to be used by people in standard word processors not some graphics app. In this way, every time the same word appears in the document, the kerning is different between each one.

chews · 2025-12-24T06:44:33 1766558673

My autism wants that idea straight into a dumpster fire.

dylan604 · 2025-12-24T16:20:19 1766593219

not really sure what this means.

most people cannot detect differences in kerning, and must be extreme adjustments to get people to notice. even then, the words would need to be aligned above/below each other for people to see the differences. however, a computer program analyzing the size of a bounding box would notice single pixel differences. so randomly adjusting the kearning per word by pixels between each letter would go unnoticed by the vast majority of readers, but could play absolute havoc with algos trying to decipher possible word combos based on bounding box size.

mlissner · 2025-12-24T00:53:44 1766537624

Really depends on the length and predictability of the redaction, but yes. If it's short and contextually it's only likely to be either "yes" or "no", you've got it. If it's longer and could contain an unknown person's name along with some other words, well, that's harder.

jmward01 · 2025-12-24T02:38:39 1766543919

I feel like this creates a hash value and the real question is how unique of a value does it represent and how easy it is to narrow it down given throwing a dictionary at it. Similarly, unknown names could likely be teased out like a one-time pad. If they appear in multiple sentences then their randomness quickly repeats and becomes something that potentially could be isolated from the rest of the words around them. This would probably be a fun problem for a cryptography class to work on.

skykooler · 2025-12-24T14:54:45 1766588085

If so, then finding the redacted string would be similar to trying to brute-force a hash (though presumably slower, since text layout algorithms are probably more complex than a single hash invocation).

IshKebab · 2025-12-24T16:20:15 1766593215

Unlikely to be possible except for the smallest redactions, like if you have a single name redacted and a list of candidates. But I think kerning wouldn't help you much more than just knowing the rough length anyway.

ComplexSystems · 2025-12-25T15:57:00 1766678220

Kerning and perplexity together could probably solve quite a few of these.