product updates, company news, and insights on building and optimizing your data pipelines.
POSIX semantics on top of object storage is an old and messy problem. Here's what's possible, what's impossible, and what ML teams should actually demand from a storage layer.
Searching for 'mount S3 as NFS' turns up a dozen FUSE-based tools. Here's why none of them survive production ML workloads, and what actually works.
EFS gives you POSIX on AWS. Training Pipes gives you POSIX plus caching plus any cloud. We ran the numbers on a realistic ML training workload to see which wins.
s3fs-fuse is a fine prototype tool and a dangerous production dependency. Here's what breaks, why, and what to use instead for real ML training workloads.
NFS and S3 solve different problems — but AI teams have to use both. Here's a clear framework for when each protocol wins, and how to stop choosing between them.