[HTML][HTML] Large-scale network analysis reveals the sequence space architecture of antibody repertoires

E Miho, R Roškar, V Greiff, ST Reddy - Nature communications, 2019 - nature.com
Nature communications, 2019nature.com
The architecture of mouse and human antibody repertoires is defined by the sequence
similarity networks of the clones that compose them. The major principles that define the
architecture of antibody repertoires have remained largely unknown. Here, we establish a
high-performance computing platform to construct large-scale networks from comprehensive
human and murine antibody repertoire sequencing datasets (> 100,000 unique sequences).
Leveraging a network-based statistical framework, we identify three fundamental principles …
Abstract
The architecture of mouse and human antibody repertoires is defined by the sequence similarity networks of the clones that compose them. The major principles that define the architecture of antibody repertoires have remained largely unknown. Here, we establish a high-performance computing platform to construct large-scale networks from comprehensive human and murine antibody repertoire sequencing datasets (>100,000 unique sequences). Leveraging a network-based statistical framework, we identify three fundamental principles of antibody repertoire architecture: reproducibility, robustness and redundancy. Antibody repertoire networks are highly reproducible across individuals despite high antibody sequence dissimilarity. The architecture of antibody repertoires is robust to the removal of up to 50–90% of randomly selected clones, but fragile to the removal of public clones shared among individuals. Finally, repertoire architecture is intrinsically redundant. Our analysis provides guidelines for the large-scale network analysis of immune repertoires and may be used in the future to define disease-associated and synthetic repertoires.
nature.com