[code talk] #atproto
I now have a snapshot dataset of ~4.8M car files - users that hit the new relay since it was started. I rigged up a node cluster (12 workers) that runs through the car files and dumps the follow graph into 12 different CSVs. Throughput bounces btwn 200-400ps
https://morel.us-east.host.bsky.network/xrpc/com.atproto.sync.getBlob?did=did:plc:ragtjsm2j2vknwkz3zp4oxrd&cid=bafkreib3qnlh4bydtfrxmiu4f76ygmqsuzelu2m7vihnwyvfawdg4cs6cy
