IPFS multi-sources pull issue / scaling issue #3802
Labels
need/analysis
Needs further analysis before proceeding
status/in-progress
In progress
topic/perf
Performance
Milestone
Version information:
go-ipfs version: 0.4.6-
Repo version: 5
System version: amd64/linux
Golang version: go1.8
Type:
Probably Bug
Priority:
P2
Description:
I was playing with IPFS and I feel that I discovered a strange behavior! I have multiple machines, let's says 4, on the first one, I generated a random binary file and I added it to IPFS. On the second node, I pulled this file by its multihash. When the data is completely transferred to the second node, I repeated the same steps again on the third node, fourth node...
What I was excepting is that the time needed to pull the image for each node to be less than the time need by the previous ones, because the last started node has more sources that can supply the data and the node can pull the data in parallel. But what I experienced was different. Each new node takes more time than the previous ones! The reason is caused by the increasing amount of data sent over the network. It seems that the file is being pulled from all the other nodes. I verified that by observing the data size passing through the network interface.
To replicate the experiment or to see the results, please refer to this repository https://github.com/jad-darrous/IPFS-multi-sources-pull-issue
The text was updated successfully, but these errors were encountered: