New top story on Hacker News: Ask HN: Books on designing disk-optimized data structures?

Ask HN: Books on designing disk-optimized data structures?
13 by memset | 13 comments on Hacker News.
Are there canonical books, resources, or readings for how to design data structures that will be primarily read and written to a disk rather than memory? Most of what I learned in school about big-O assumes that, for example, random access is O(1). However, random disk reads are really slow due to spacial locality. People who write databases obviously have solutions to this problem - for example, DuckDB is based on a number of papers that have come out over the years on this topic. If I wanted to design, ie, a tree structure which was intended to be read/written from a disk, are there general principles or patterns the have been developed to take advantage of locality of reference, minimize random reads, or decrease the overhead of writes, that I could familiarize myself with? What is the CLRS for disk?

Comments

Popular posts from this blog

Northeastern US braces for foot of snow during first days of December

North Korea test fires two missiles month before deadline for US to respond on talks

Democratic debate winners and losers: Elizabeth Warren triumphs while Beto O'Rourke flounders