Haystack-based Facebook’s data storage architecture: store, directory, and cache

Tole Sutikno, Ahmad Heryanto, Laksana Talenta Ahmad

Abstract


Haystack is Facebook's unique way of managing large amounts of user-generated content like photos. The architecture prioritizes performance, reliability, and scalability to overcome network-attached storage system bottlenecks. Haystack speeds data access and ensures data integrity during hardware failures by using physical and logical volumes. This study examines the architecture of Facebook's Haystack data storage system and its effects on scalability and efficiency in handling large photo data. According to the study, the store, directory, and cache functions work together to reduce input/output (I/O) operations and improve metadata processing, which traditional network-attached storage systems cannot do. Haystack manages massive photo data storage and retrieval, solving network-attached storage (NAS) limitations. It balances throughput and latency by minimizing disk operations and optimizing metadata processing. Each store, directory, and cache contribute to this ecosystem. The Haystack architecture reduces disk operations and metadata processing bottlenecks with distributed caching. A cache allows instant access to frequently requested images and balances read and write operations across the system. We should study advanced storage system architectures based on Facebook's Haystack architecture. This could involve investigating faster metadata processing algorithms, using artificial intelligence (AI) to improve fault detection and repair systems, and assessing the economic impact of distributed caches.

Full Text:

PDF


DOI: http://doi.org/10.11591/ijaas.v14.i3.pp671-681

Refbacks

  • There are currently no refbacks.


Copyright (c) 2025 Tole Sutikno, Ahmad Heryanto, Laksana Talenta Ahmad

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

View the IJAAS Visitor Statistics

International Journal of Advances in Applied Sciences (IJAAS)
p-ISSN 2252-8814, e-ISSN 2722-2594
This journal is published by Intelektual Pustaka Media Utama (IPMU) in collaboration with the Institute of Advanced Engineering and Science (IAES).