HaploThread User Manual (v1.0.0)

1 Preface

1.1 Overview

HaploThread is a cross-platform desktop software designed for constructing and visualizing haplotype networks. Developed in C++ using the Qt framework, it integrates multithreaded algorithms including McAN and fastHaN (which contains MSN, MJN, TCS, etc.) and adopts a plugin-based architecture for modular extension. HaploThread provides an intuitive graphical user interface (GUI) that allows users to rapidly build and interactively explore haplotype networks from large-scale sequence datasets, supporting genetic variation and evolutionary relationship analysis.

The software runs fully offline, eliminating network dependence and ensuring data privacy. HaploThread delivers high computational performance for large datasets (millions of sequences), catering to population genetics, pathogen evolution tracking, and epidemiological research.

1.2 Background and Motivation

Existing desktop tools for haplotype network construction (e.g., TCS, Network, PopART, HapStar) mostly rely on single-threaded algorithms, which cannot efficiently handle large-scale sequence data, and often lack interactive visualization. Recently developed multithreaded algorithms such as McAN and fastHaN significantly improve computational efficiency but lack an integrated desktop platform that combines high-performance algorithms with a user-friendly GUI.

HaploThread aims to provide a powerful and easy-to-use platform featuring:

  1. Intuitive visualization — interactive network graphs displaying haplotype evolutionary relationships.
  2. High computational efficiency — multithreaded parallel algorithms for large-scale datasets.
  3. Ease of use — GUI-based workflow without command-line operations.
  4. Security and reliability — fully local execution ensures data privacy.
  5. Extensibility — plugin-based architecture supporting algorithm and feature expansion.

1.3 Target Users

HaploThread is designed for researchers, data analysts, and educators in population genetics, molecular epidemiology, phylogenetics, pathogen surveillance and origin tracing.

1.4 Feature Summary

  1. Integrated algorithms: McAN, MJN, MSN, TCS, etc.
  2. Versatile formats: supports VCF, PHYLIP, GraphML I/O.
  3. Interactive visualization: geospatial and temporal dynamic rendering.
  4. Cross-platform: binaries for Windows & macOS.
  5. Open & extensible: plugin architecture and GPL open-source license.

2 Installation and Configuration

Download: Download the package HaploThread-1.0.0-win-x64.zip from: https://ngdc.cncb.ac.cn/biocode/tool/BT007948 or
https://github.com/git-xubo/HaploThread

Extract: Unzip the package into a directory with English characters only.

Run: No installation is required. Simply double-click: ...\HaploThread-1.0.0-win-x64\HaploThread.exe to launch the program (Figure 1).

Figure 1. Home page of HaploThread.

3 Basic Usage Examples

3.1 Constructing a Haplotype Network

3.1.1 Entering the Construction Page

Click the haplotype construction icon  on the home page (Figure 2) to enter the construction interface (Figure 3).

Figure 2. “haplotype construction” highlighted on the home page.

Figure 3. Haplotype construction interface.

3.1.2 Preparing Input Files

Download the example dataset Diamond_Princess_Data.zip from: https://ngdc.cncb.ac.cn/biocode/tool/BT007948

It includes:

  • Diamond_Princess.vcf: genomic variants
  • Diamond_Princess.tsv: metadata

3.1.3 Setting Parameters

Configure parameters under Algorithm, as shown in Figure 3 and Table 1.

Table 1. Parameter settings for haplotype network construction (McAN example)

Parameter

Description

Algorithm

Keep default (McAN)

Input VCF file

Diamond_Princess.vcf

Input meta file

Diamond_Princess.tsv

Output location

Directory for result files

Output files prefix

Prefix for output files

Number of threads

Set based on available resources

Minimal mutation rate

Set to 0 (no mutation filtering)

Infer community structure

Recommended to enable

3.1.4 Running the Construction

Click Run to start.

The constructed haplotype network appears on the right panel after completion (Figure 4).

Figure 4. Example of construction results.

3.2 Visualization

After construction, visualization settings can be adjusted in the Properties panel.

3.2.1 Node Coloring

Under Visual, select a coloring scheme.

  • By Country/Region — colors nodes by geographic origin (Figure 5). DP samples (from the Diamond Princess cruise ship) appear as a distinct lineage.

Figure 5. Coloring by geographic information.

  • By Month / By Year — colors nodes by sampling date (Figures 6 and 7).
  • By Group — colors nodes by detected clusters (Figure 8).

Figure 6. Haplotype network colored by month.

Figure 7. Haplotype network colored by year.

Figure 8. Haplotype network colored by cluster.

3.2.2 Hiding or Showing Nodes

In Graph, select Display by Day, then drag the slider to filter nodes by time (Figure 9).

Figure 9. Haplotype network showing or hiding nodes based on time.

3.2.3 Manual Layout Adjustment

Click the pointer  icon to switch to dragging mode and manually adjust node positions.

3.2.4 Displaying Haplotypes

Figure 10. Displaying the bases of a haplotype.

3.3 Exporting the Haplotype Network

Click Export to save the network visualization. Supported formats:

  • SVG
  • PNG
  • JPG

GraphML output is saved automatically under the output directory as: run1.haplonet.graphml (if the prefix is run1).

3.4 Visualizing a Pre-built Network

To visualize an existing GraphML haplotype network: Click Haplotype visualization  on the home page (Figure 11). Go to File → Open… (Figure 12) and load a .graphml file. The same visualization features become available (Figure 13).

Figure 11. The haplotype visualization icon is highlighted on the HaploThread homepage.

Figure 12. Opening a GraphML file in the visualization module.

Figure 13. Software screenshot after opening a GraphML file in the visualization module.

4 FAQ

4.1 Missing VCRUNTIME140.dll / VCRUNTIME140_1.dll / MSVCP140.dll

Error message:

VCRUNTIME140.dll, VCRUNTIME140_1.dll, or MSVCP140.dll not found.

Cause: Missing Microsoft Visual C++ runtime libraries.

Solution:

Install the required Microsoft Visual C++ Redistributable.

A download link is provided in the Dependencies section of this manual.

Restart HaploThread after installation.

5 How to Cite

If HaploThread is used in your research, please cite:

BibTex:

@article {Xu2025.07.06.659816,

        author = {Xu, Bo and Li, Lun and Li, Cuiping and Wang, Anke and Fan, Zhuojing and Song, Shuhui},

        title = {HaploThread: A Scalable Integrated Desktop Platform for Constructing and Visualizing Haplotype Networks for Large-sample Sequences},

        elocation-id = {2025.07.06.659816},

        year = {2025},

        doi = {10.1101/2025.07.06.659816},

        publisher = {Cold Spring Harbor Laboratory},

        URL = {https://www.biorxiv.org/content/early/2025/07/10/2025.07.06.659816},

        eprint = {https://www.biorxiv.org/content/early/2025/07/10/2025.07.06.659816.full.pdf},

        journal = {bioRxiv}

}

Plain text:

Xu B, Li L, Li C, Wang A, Fan Z, Song S. (2025). HaploThread: A Scalable Integrated Desktop Platform for Constructing and Visualizing Haplotype Networks for Large-sample Sequences. bioRxiv, 2025.07.06.659816. doi:10.1101/2025.07.06.659816

Available at: https://www.biorxiv.org/content/early/2025/07/10/2025.07.06.659816

You may also cite the software version:

HaploThread v1.0.0, National Center for Bioinformation, Beijing

Download: https://ngdc.cncb.ac.cn/biocode/tool/BT007948

6 References

  1. Bandelt HJ, Forster P, Röhl A. 1999. Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol 16:37-48.
  2. Chi L, Dong Y, Wang R, Yang S, Wu L, Xue Y, Chen H. 2025. HapNetworkView: a tool for haplotype network exploration and visualization. BMC Genomics 26:52.
  3. Chi L, Zhang X, Xue Y, Chen H. 2023. fastHaN: a fast and scalable program for constructing haplotype network for large-sample sequences. Mol Ecol Resour.
  4. Clement M, Posada D, Crandall KA. 2000. TCS: a computer program to estimate gene genealogies. Mol Ecol 9:1657-1659.
  5. Excoffier L, Smouse PE. 1994. Using allele frequencies and geographic subdivision to reconstruct gene trees within a species: molecular variance parsimony. Genetics 136:343-359.
  6. Kumar S, Dudley J. 2007. Bioinformatics software for biologists in the genomics era. Bioinformatics 23:1713-1717.
  7. Leigh JW, Bryant D. 2015. popart: full-feature software for haplotype network construction. Methods in Ecology and Evolution 6:1110-1116.
  8. Li L, Li C, Li N, Zou D, Zhao W, Luo H, Xue Y, Zhang Z, Bao Y, Song S. 2024. Machine Learning Early Detection of SARS-CoV-2 High-Risk Variants. Adv Sci (Weinh) 11:e2405058.
  9. Li L, Xu B, Tian D, Wang A, Zhu J, Li C, Li N, Zhao W, Shi L, Xue Y, et al. 2023. McAN: a novel computational algorithm and platform for constructing and visualizing haplotype networks. Brief Bioinform 24.
  10. Matschiner M. 2016. Fitchi: haplotype genealogy graphs based on the Fitch algorithm. Bioinformatics 32:1250-1252.
  11. Song S, Ma L, Zou D, Tian D, Li C, Zhu J, Chen M, Wang A, Ma Y, Li M, et al. 2020. The Global Landscape of SARS-CoV-2 Genomes, Variants, and Haplotypes in 2019nCoVR. Genomics Proteomics Bioinformatics 18:749-759.
  12. Teacher AG, Griffiths DJ. 2011. HapStar: automated haplotype network layout and visualization. Mol Ecol Resour 11:151-153.
  13. Templeton AR, Crandall KA, Sing CF. 1992. A cladistic analysis of phenotypic associations with haplotypes inferred from restriction endonuclease mapping and DNA sequence data. III. Cladogram estimation. Genetics 132:619-633.
  14. Vences M, Patmanidis S, Schmidt J-C, Matschiner M, Miralles A, Renner SS. 2024. Hapsolutely: a user-friendly tool integrating haplotype phasing, network construction, and haploweb calculation. Bioinformatics Advances 4.

7 Contributor Statement

Name

Contribution

Shuhui Song

Overall project supervision

Bo Xu

Architecture design, implementation, testing, documentation

Lun Li

Algorithm implementation, testing, documentation

Cuiping Li

Testing, documentation

Anke Wang

Visualization implementation

Zhuojing Fan

Logo & icon design

8 Acknowledgements

This work was supported by:

  • ANSO Collaborative Research Program (Grant No. ANSO-CR-KP-2022-09)
  • National Natural Science Foundation of China (Grant No. 32270718, 32170678)

We thank CNCB and NGDC for technical support, as well as all colleagues who provided valuable feedback during development and testing.

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值