Chris Ellis: "@azonenberg@ioc.exchange @whitequark@treehouse.sy…"

✧✦Catherine✦✧ @whitequark@treehouse.systems

@whitequark Most applications that use UUIDs these days tend to use version 7 UUIDs, which use milliseconds since the Unix epoch as the most significant bits. This embeds the creation timestamp in the ID and allows for sorting, while also adding in sufficient randomness so they're not incremental and multiple systems can generate them with low probability of collision.

**✧✦Catherine✦✧** @whitequark@treehouse.systems · May 25, 2026, 21:06

**✧✦Catherine✦✧** @whitequark@treehouse.systems · May 25, 2026, 21:06

May 25, 2026, 21:06

@ramsey oh, I didn't know this!

**Andrew Zonenberg** @azonenberg@ioc.exchange · May 25, 2026, 21:09

**Andrew Zonenberg** @azonenberg@ioc.exchange · May 25, 2026, 21:09

May 25, 2026, 21:09

Andrew Zonenberg @azonenberg@ioc.exchange

@whitequark @ramsey I've only ever seen version 4 (unstructured random) in production that i can recall

**Andrew Zonenberg** @azonenberg@ioc.exchange · May 25, 2026, 21:11

**Andrew Zonenberg** @azonenberg@ioc.exchange · May 25, 2026, 21:11

May 25, 2026, 21:11

Andrew Zonenberg @azonenberg@ioc.exchange

@whitequark @ramsey (this is the first I've ever heard of v7)

**Chris Ellis** @intrbiz@bergamot.social · 2026-05-25T21:20:19Z

@azonenberg @whitequark @ramsey

V7 is fairly new, standardised around 2024. They've got a bit more adoption in databases over the last year.

May 25, 2026, 21:20 · · Mastodon for Android · · ·

**Emelia/Emi** @becomethewaifu@tech.lgbt · May 25, 2026, 21:23

**Emelia/Emi** @becomethewaifu@tech.lgbt · May 25, 2026, 21:23

May 25, 2026, 21:23

Emelia/Emi @becomethewaifu@tech.lgbt

@intrbiz @azonenberg @whitequark @ramsey Yeah, postgres 18 added support for generating them IIRC, though UUIDs are inherently "mostly backwards compatible" unless you're trying to parse them for some godforsaken reason, so older versions support it just fine if the client generates them.

They make for much happier indexes and sharding vs the ones with leading-random, because most workloads don't have truly random access patterns...

**Chris Ellis** @intrbiz@bergamot.social · May 25, 2026, 21:26

**Chris Ellis** @intrbiz@bergamot.social · May 25, 2026, 21:26

May 25, 2026, 21:26

@becomethewaifu

Indeed I did a talk at POSETTE last year talking about encoding information into UUIDs and some of the index issues.

IMHO you can have more fun that just encoding generation time into them.

**Ben Ramsey** @ramsey@phpc.social · May 25, 2026, 21:45

**Ben Ramsey** @ramsey@phpc.social · May 25, 2026, 21:45

May 25, 2026, 21:45

@intrbiz @becomethewaifu Version 8 is probably more suited to those use-cases, though.

**Chris Ellis** @intrbiz@bergamot.social · May 25, 2026, 21:50

**Chris Ellis** @intrbiz@bergamot.social · May 25, 2026, 21:50

May 25, 2026, 21:50

@ramsey

Indeed, my code was generating UUIDs marked as V8. The version is just a nibble that's been standardised. And it's handy to have a standardised version number for custom generation schemes.

**Keith Wansbrough** @kw217@mathstodon.xyz · May 25, 2026, 22:23

**Keith Wansbrough** @kw217@mathstodon.xyz · May 25, 2026, 22:23

May 25, 2026, 22:23

@intrbiz @ramsey time-based (type 1 back in the day, now type 7) is really useful for debugging; the creation date and time can be a really strong hint.

**Ben Ramsey** @ramsey@phpc.social · May 25, 2026, 22:26

**Ben Ramsey** @ramsey@phpc.social · May 25, 2026, 22:26

May 25, 2026, 22:26

@kw217 @intrbiz Version 1 still exists, but it’s based on the weird value of 100-nanosecond intervals since the Gregorian epoch in 1582. Version 7 is based on milliseconds since the Unix epoch.

**Ben Ramsey** @ramsey@phpc.social · May 25, 2026, 22:30

**Ben Ramsey** @ramsey@phpc.social · May 25, 2026, 22:30

May 25, 2026, 22:30

Irenes (many) @ireneista@irenes.space

@kw217 @intrbiz One reason not to use version 1 is that it leaks details about the system (i.e., the MAC address). Another reason is that the values aren’t sortable. Version 6 was introduced to solve this. It’s also based on 100-nanosecond intervals since the Gregorian epoch, but it’s sortable and uses random bytes following the timestamp, rather than the MAC address.

But, for most purposes, version 7 is the right solution, unless you need to create UUIDs for dates earlier than 1970.

**Irenes (many)** @ireneista@irenes.space · May 25, 2026, 22:33

**Irenes (many)** @ireneista@irenes.space · May 25, 2026, 22:33

May 25, 2026, 22:33

@ramsey @kw217 @intrbiz as a privacy person we should say that individuals and activist groups probably want to avoid leaking timestamps, especially fine-grained timestamps, pretty much everywhere and always, although the attack models are highly indirect and people don't usually know what they're protecting until they've lost it

it's fine for corporate use as long as it can't be tied to an individual

**Keith Wansbrough** @kw217@mathstodon.xyz · May 25, 2026, 23:10

**Keith Wansbrough** @kw217@mathstodon.xyz · May 25, 2026, 23:10

May 25, 2026, 23:10

@ireneista @ramsey @intrbiz entirely fair - this was in a corporate context; in activism circles (hmm, in most places now) one should definitely think carefully about privacy properties.

**Chris Ellis** @intrbiz@bergamot.social · May 25, 2026, 23:18

**Chris Ellis** @intrbiz@bergamot.social · May 25, 2026, 23:18

May 25, 2026, 23:18

@ireneista @ramsey @kw217

Valid concern in some domains for sure. I tend to keep the time buckets pretty big. There are block based approaches which remove the time related issues but still reduce issue with things like indexes.

**Irenes (many)** @ireneista@irenes.space · May 25, 2026, 23:22

**Irenes (many)** @ireneista@irenes.space · May 25, 2026, 23:22

May 25, 2026, 23:22

Irenes (many) @ireneista@irenes.space

@intrbiz @ramsey @kw217 yeah, fascinatingly, if you allocate IDs at a global scale via sharding, the shards wind up forming a weak proxy for geographic location

this stuff is really hard to get right

**Vincent Sparks** @AVincentInSpace@furry.engineer · May 26, 2026, 02:35

**Vincent Sparks** @AVincentInSpace@furry.engineer · May 26, 2026, 02:35

May 26, 2026, 02:35

Vincent Sparks @AVincentInSpace@furry.engineer

@ireneista @ramsey @kw217 @intrbiz huh? what does that reveal, other than a time that your computer was on and (maybe, if your UUID generation code is deeply misconfigured) your timezone?

**Irenes (many)** @ireneista@irenes.space · May 26, 2026, 02:38

**Irenes (many)** @ireneista@irenes.space · May 26, 2026, 02:38

May 26, 2026, 02:38

Irenes (many) @ireneista@irenes.space

@AVincentInSpace @ramsey @kw217 @intrbiz well, exactly that. whether that's a problem depends on what you use it for.

**Ben Ramsey** @ramsey@phpc.social · May 26, 2026, 02:56

**Ben Ramsey** @ramsey@phpc.social · May 26, 2026, 02:56

May 26, 2026, 02:56

@ireneista @AVincentInSpace @kw217 @intrbiz Right. Let’s say that someone is trying to find out what you were doing at a certain time. If they find an ID that was generated at a specific time and owned by your account, then they can deduce a general idea of what you might have been doing at that time (i.e., you were probably using a certain app during that time).

**Keith Wansbrough** @kw217@mathstodon.xyz · May 26, 2026, 07:41

**Keith Wansbrough** @kw217@mathstodon.xyz · May 26, 2026, 07:41

May 26, 2026, 07:41

@ramsey @ireneista @AVincentInSpace @intrbiz having very fine grained timing info lets you very precisely correlate messages across systems. Of the thousands of messages that went across the network in this second, despite any crypto, you can say *this* one was the one your subject sent, and *this* is where it went in the network, with high degree of confidence. (Other attacks too, like fingerprinting the sender's clock, but they're a bit more involved.)

**Keith Wansbrough** @kw217@mathstodon.xyz · May 25, 2026, 23:13

**Keith Wansbrough** @kw217@mathstodon.xyz · May 25, 2026, 23:13

May 25, 2026, 23:13

@ramsey @intrbiz good ol' Microsoft "hectonanoseconds". Weird unit but presumably shoehorned into a particular bitwidth and range that made sense back in the day (Windows NT?)

**Ben Ramsey** @ramsey@phpc.social · May 25, 2026, 23:21

**Ben Ramsey** @ramsey@phpc.social · May 25, 2026, 23:21

May 25, 2026, 23:21

https://ioc.exchange/@peaceful_online/116637731391260403

@kw217 @intrbiz Probably much earlier than that. 😉

**Keith Wansbrough** @kw217@mathstodon.xyz · May 26, 2026, 07:37

**Keith Wansbrough** @kw217@mathstodon.xyz · May 26, 2026, 07:37

May 26, 2026, 07:37

@ramsey @intrbiz that article has the Apollo timestamp (v0) as 48 bits, which isn't wide enough for hectonanoseconds. Wikipedia agrees they came in with Windows in the nineties.

**Ben Ramsey** @ramsey@phpc.social · May 26, 2026, 13:04

**Ben Ramsey** @ramsey@phpc.social · May 26, 2026, 13:04

May 26, 2026, 13:04

@kw217 @intrbiz You’re right. The article doesn’t say when they started using the Gregorian timestamp. I was making an assumption. I guess Microsoft was the first to do that?

**Keith Wansbrough** @kw217@mathstodon.xyz · May 26, 2026, 17:05

**Keith Wansbrough** @kw217@mathstodon.xyz · May 26, 2026, 17:05

May 26, 2026, 17:05