152-midpoint-insertion-caching-strategy

How MySQL Avoids Performance Hits from Table Scans

Source: https://arpitbhayani.me/blogs/midpoint-insertion-caching-strategy Date: 2020-04-26

Explore MySQL InnoDB's buffer pool and its midpoint insertion strategy for efficient cache management and scan resistance.

Disk reads are 4x (for SSD) to 80x (for magnetic disk) slower as compared to main memory (RAM) reads and hence it becomes extremely important for a database to utilize main memory as much as it can, and be super-performant while keeping its latencies to a bare minimum. Engines cannot simply replace disks with RAM because of volatility and cost, hence it needs to strike a balance between the two - maximize main-memory utilization and minimize the disk access.

The database engine virtually splits the data files into pages. A page is a unit which represents how much data the engine transfers at any one time between the disk (the data files) and the main memory. It is usually a few kilobytes 4KB, 8KB, 16KB, 32KB, etc. and is configurable via engine parameters. Because of its bulky size, a page can hold one or multiple rows of a table depending on how much data is in each row i.e. the length of the row.

How MySQL Avoids Performance Hits from Table Scans

Source: https://arpitbhayani.me/blogs/midpoint-insertion-caching-strategy Date: 2020-04-26

Explore MySQL InnoDB's buffer pool and its midpoint insertion strategy for efficient cache management and scan resistance.

How MySQL Avoids Performance Hits from Table Scans

152-midpoint-insertion-caching-strategy

How MySQL Avoids Performance Hits from Table Scans

Locality of reference

Spatial Locality of Reference

Temporal Locality of Reference

LRU Cache

Implementation

InnoDB’s Buffer Pool

A notorious problem with Sequential Scans

Midpoint Insertion Strategy

Eviction

Insertion

Moving page from Old to the Young sublist

MySQL parameter to tune the midpoint

Conclusion

References