diff --git a/README.md b/README.md
index dcdcb0b..e8a6612 100644
--- a/README.md
+++ b/README.md
@@ -3,16 +3,13 @@
A KVCache-centric Disaggregated Architecture for LLM Serving
+ Paper
+ | Slides
+ | Traces
+ | Technical Report
-| [**Paper**](https://www.usenix.org/system/files/fast25-qin.pdf)
-| [**Slides**](https://www.usenix.org/system/files/fast25_slides-qin.pdf)
-| [**Traces**](FAST25-release/traces)
-| [**Technical Report**](https://arxiv.org/abs/2407.00079)
-|
-
-
Mooncake is the serving platform for
Kimi, a leading LLM service provided by
Moonshot AI.
Now both the Transfer Engine and Mooncake Store are open-sourced!
This repository also hosts its technical report and the open sourced traces.
@@ -70,9 +67,6 @@ With 40 GB of data (equivalent to the size of the KVCache generated by 128k toke
P2P Store is built on the Transfer Engine and supports sharing temporary objects between peer nodes in a cluster. P2P Store is ideal for scenarios like checkpoint transfer, where data needs to be rapidly and efficiently shared across a cluster.
**P2P Store has been used in the checkpoint transfer service of Moonshot AI.**
-### Mooncake Store ([Guide](doc/en/mooncake-store-preview.md))
-Mooncake Store is a distributed KVCache storage engine specialized for LLM inference. It offers object-level APIs (`Put`, `Get` and `Remove`), and we will soon release an new vLLM integration to demonstrate xPyD disaggregation. Mooncake Store is the central component of the KVCache-centric disaggregated architecture.
-
#### Highlights
- **Decentralized architecture.** P2P Store leverages a pure client-side architecture with global metadata managed by the etcd service.
@@ -83,6 +77,9 @@ Thanks to the high performance of Transfer Engine, P2P Stores can also distribut

+### Mooncake Store ([Guide](doc/en/mooncake-store-preview.md))
+Mooncake Store is a distributed KVCache storage engine specialized for LLM inference. It offers object-level APIs (`Put`, `Get` and `Remove`), and we will soon release an new vLLM integration to demonstrate xPyD disaggregation. Mooncake Store is the central component of the KVCache-centric disaggregated architecture.
+
### vLLM Integration ([Guide v0.2](doc/en/vllm-integration-v0.2.md))
To optimize LLM inference, the vLLM community is working on supporting [disaggregated prefilling (PR 10502)](https://github.com/vllm-project/vllm/pull/10502). This feature allows separating the **prefill** phase from the **decode** phase in different processes. The vLLM uses `nccl` and `gloo` as the transport layer by default, but currently it cannot efficiently decouple both phases in different machines.