richardstore

A reliable file storage system inspired by enterprise solutions that automatically stores backup copies of files across multiple computers. When one storage node fails, the system detects this and ensures files remain accessible. Built as a learning project to demonstrate how large-scale storage systems maintain data reliability.

Key features:

Stores files reliably with automatic backup copies across multiple storage nodes
Self-healing system that detects when a storage node fails and creates new backup copies
Simple API for uploading, downloading, and managing files

installation

requirements:

Docker
Go 1.23
k3d
kubectl

brew install k3d
brew install kubectl

Then you can read through deploy.sh, or just run:

chmod +x deploy.sh #to make it executible
bash deploy.sh
# You may need to run the following to enable port forwarding if not already enabled.
kubectl port-forward service/client 8080:80

This should create the required k3d cluster, build all the docker images, and run the cluster. You should be able to go to http://localhost:8080/ and see this:

File Upload Demo:

Screen.Recording.2025-03-30.at.5.46.24.PM.mov

Client API ref

Upload a File

Upload a file to the distributed file system.

URL: /upload

Method: POST

Content-Type: multipart/form-data

Form Parameters:

Parameter	Type	Required	Description
file	File	Yes	The file to upload

Response:

{
  "hash": "string"
}

Error Response:

{
  "error": "string"
}

Example:

curl -X POST \
  http://localhost:8080/upload \
  -F 'file=@/path/to/your/file.txt'

Download a File

Download a file from the distributed file system using its hash.

URL: /download

Method: GET

Query Parameters:

Parameter	Type	Required	Description
hash	string	Yes	The hash identifier of the file to download

Response:

The binary content of the requested file.

Error Response:

{
  "error": "string"
}

Example:

curl -X GET \
  'http://localhost:8080/download?hash=abc123' \
  --output downloaded_file.txt

Get System Information

Retrieve information about the distributed file system.

URL: /info

Method: GET

Response:

{
  "service": "Distributed File System",
  "status": "running",
  "namenode": "string"
}

Example:

curl -X GET http://localhost:8080/info

explanation

CAS File storage

CAS (Content Addressable Storage) is a type of file system that stores data based on its content, rather than its location. This allows for efficient retrieval and deduplication of data. This is all implemented in the store/storage.go file, where the Store class provides concurency safe methods for interacting with the storage system.

// Example of writing a File
file, err := os.Open("my/file")
if err !=nil {
	log.Fatal(err)
}
defer file.Close()
hash, err := store.Write(file)

The hash of a file might look something like: c0c61d6932fc8436ec0a8536ed0f191a0ef8b5e7eec9ac3c2657a7ce319388dc

And internally, this is broken down dependent on the blocksize. With a blocksize of 5, the actual file path may look something like: root/c0c61/d6932/fc843/6ec0a/8536e/d0f19/1a0ef/8b5e7/eec9a/c3c26/57a7c/e319388dc/c0c61d6932fc8436ec0a8536ed0f191a0ef8b5e7eec9ac3c2657a7ce319388dc

Where the hash of the file itself is chunked to create the folders.

Architecture

richardstore is similar to HDFS. It has a namenode, and a series of datanodes. A namenode manages the file system metadata, things like location of files, which datanodes have which files, health of datanodes, etc. The datanodes store the actual files (blocks in hdfs). All inter-system communication is done with gRPC, and the client exposes a REST HTTP endpoint for ease of use. You can find the full hdfs architecture guide here

At a highlevel, this is what richardstore looks like:

Heartbeats

The datanode uses heartbeats to inform the namenode that it is alive and can handle read traffic. It is also what is used to transfer commands between the two nodes. If a datanode doesnt send a heartbeat after a specified interval, the namenode will mark that node as dead and will not direct any traffic to it until it starts sending heartbeats again. This and file replication allows for certain levels of fault tolerance. If a node is dead, other nodes can serve the same data and cover for it until the node is up. The namenode can also issue replication commands to datanodes. For example, if a file is held by 3/5 nodes in the file system, and 2 nodes go down, the namenode can detect this and when the last node sends a heartbeat, issue a replication command to duplicate the file to the other 2 nodes in the file system to ensure that the file is still held by atleast 3/5 nodes in the system.

BlockReports

Blockreports are what the datanodes use to tell the namenode what files they have. The namenodes upon receiving this update their internal map of file -> datanodes for future read operations. A blockreport is basically just a list of all of the files that a datanode has. An incremental block report is essentially a compressed version of the blockreport which only tracks the changes that have happened on the file system (adding/deleting a file)

Writing files (Replication chain)

To write a file, the client first asks the namenode where to write the file to. The namenode will allocate space on the file system and return the addresses of the datanodes to write to. The client then directly writes to the first datanode of that list.

message FileStream {
    oneof type {
        StreamInfo stream_info = 1;
        bytes chunk = 2;
    }
}
message StreamInfo {
    FileInfo file_info =1;
    repeated DataNodeInfo data_nodes = 2;
}

You can see that in the StreamInfo, we can specify a list of datanodes that we want this file replicated to. The DataNode will write the file to its file system, and then initiate the replication change if needed. The client only gets a success when all nodes on the chain have confirmed that they have succesfully written the file.

Now, after writing the file directly to the datanode, they aren't immediately readable because the namenode is only aware of the intention to write the file, it doesnt know which datanodes have the file yet. Eventually after some short interval, the datanodes will all send either a BlockReport or an IncrementalBlockReport to the namenode. This makes the namenode aware that the datanode is holding onto the file, and makes the file readable from this point.

Reading files

The client provides the namenode with a hash, and this hash is used to lookup against the namenodes filemap to check which replicas have the file. The namenode then returns the list of replicas, which the client can directly read from for the file. The client currently reads sequentially down the list until a successful read is made, but an alternative possibility is that the client can try all connections concurrently and cancel the others upon getting a successful read.

Deleting files

To delete files, the namenode will propogate a command to all Datanodes during the next heartbeat. The namenode also has to track a list of recently deleted files. This is because theres a possibility that datanodes will send the deleted file back during a blockreport (if they havent seen the delete command yet), and then the namenode will not be able to differentiate between: Datanode has a new File, or datanode still has a deleted file.

limitations:

Building all of hdfs would have taken me forever. For one, I didn't implement blocks, which are a core part of HDFS. I was already toying around with building a CAS file system and reusing that would make my life way easier. Theres also some durability + auth features that I opted to not build.

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
.github/workflows		.github/workflows
bin		bin
cmd		cmd
hdfs		hdfs
k8s		k8s
proto		proto
store		store
Makefile		Makefile
README.md		README.md
comparaison.go		comparaison.go
deploy.sh		deploy.sh
docker-compose.yml		docker-compose.yml
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

richardstore

installation

Client API ref

Upload a File

Download a File

Get System Information

explanation

CAS File storage

Architecture

Heartbeats

BlockReports

Writing files (Replication chain)

Reading files

Deleting files

limitations:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

notzree/richardstore

Folders and files

Latest commit

History

Repository files navigation

richardstore

installation

Client API ref

Upload a File

Download a File

Get System Information

explanation

CAS File storage

Architecture

Heartbeats

BlockReports

Writing files (Replication chain)

Reading files

Deleting files

limitations:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages