
Flowchart of CancerLocator. Step 1: A set of solid tumor samples and healthy plasma samples collected from public databases and the literature are used to select the informative features (CpG clusters) that can differentiate tumor types or healthy plasma samples. Then the beta distributions of the methylation levels of these selected features for each tumor type or healthy plasma samples are learnt. Step 2: Given a plasma sample, the methylation profile of its cfDNAs is measured by whole-genome bisulfite sequencing, which is then used as input for cancer location prediction by CancerLocator
US Researchers have developed a computer program that can simultaneously detect cancer and identify where in the body the cancer is located, from a patient’s blood sample. Professor Jasmine Zhou, ULCA, said: “Non-invasive diagnosis of cancer is important, as it allows the early diagnosis of cancer, and the earlier the cancer is caught, the higher chance a patient has of beating the disease. We have developed a computer-driven test that can detect cancer, and also identify the type of cancer, from a single blood sample. The technology is in its infancy and requires further validation, but the potential benefits to patients are huge.”
The program works by looking for specific molecular patterns in cancer DNA that is free flowing in the patients’ blood and comparing the patterns against a database of tumour epigenetics, from different cancer types, collated by the authors. DNA from tumour cells is known to end up in the bloodstream in the earliest stages of cancer so offers a unique target for early detection of the disease.
Professor Zhou explained: “We built a database of epigenetic markers, specifically methylation patterns, which are common across many types of cancer and also specific to cancers originating from specific tissue, such as the lung or liver. We also compiled the same ‘molecular footprint’ for non-cancerous samples so we had a baseline footprint to compare the cancer samples against. These markers can be used to deconvolute the DNA found freely in the blood into tumor DNA and non-tumor DNA.”
In this study, the new computer program and two other methods (Random Forest and Support Vector Machine) were tested with blood samples from 29 liver cancer patients, 12 lung cancer patients and 5 breast cancer patients. Tests were run 10 times on each sample to validate the results. The Random Forest and Support Vector Machine methods had an overall error rate (the chance that the test produces a false positive) of 0.646 and 0.604 respectively, while the new program obtained a lower error rate of 0.265.
25 out of the 29 liver cancer patients and 5 out of 12 lung cancer patients tested in this study had early stage cancers, which the program was able to detect in 80% of cases. Although the level of tumour DNA present in the blood is much lower during the early stages of these cancers, the program was still able to make a diagnosis demonstrating the potential of this method for the early detection of cancer, according to the researchers.
Professor Zhou added: “Owing to the limited number of blood samples, the results of this study are evaluated only on 3 cancer types (breast, liver and lung). In general, the higher the fraction of tumor DNAs in blood, the more accurate the program was at producing a diagnostic result. Therefore, tumors in well-circulated organs, such as the liver or lungs are easier to diagnose early using this approach, than in less-circulated organs such as the breast.” http://genomebiology.biomedcentral.com/articles/10.1186/s13059-017-1191-5 https://eurekalert.org/pub_releases/2017-03/bc-cpd032117.php




Recent Comments