Daily Issues and Solutions of a Nano-bioinformatic Research

1/3/2019 ● 4 minutes to read

The American cancer society has stated in 2019 that there are 1,762,450 new cancer cases and 606,880 cancer deaths during the first 2 quarters of the year 2019 alone. In a quick calculation, one can find it's 1660 new deaths per day on average. Therefore, cancer research in particular and new biological oriented problems globally are of first importance to humanity. This is a personal issue as well; statistics show that almost every person knows someone who knows someone with cancer. Do you?

Biological research of cancer uses techniques such as "deep sequencing" of a patient's genome which suffers from cancer; or "medical nanorobots" to find and treat cancer. One may ask how does computer science integrate with biological research. The answer lays in the field of nano-bioinformatics. As data collecting becomes more available and cheaper using new hardware solutions, and as the human population grows - the amount of data is overwhelming. A person is not yet able to grasp such amounts of data and therefore biological researchers put their fate in computers to handle this task and that is why programming is a new major tool in the toolbox of nowadays researchers. Furthermore, as the field of medical nanorobotics grows it is clearer than ever that researchers are able to design new solutions based on personalized medicine and computational theory to construct better treatments and diagnostics.

In short, the field of medical nanorobotics takes advantage of human-made swarms of particles that range in size from 1 nm to 100 nm that can cooperate together. Such swarm can be injected into a patient's bloodstream and uses targeted drug delivery to treat many kinds of illnesses. Not only, but newer research shows that such swarm of medical nanorobotics is programmable which opens new and exciting opportunities for treatments in the future. Nanorobotics and Bioinformatics are similar in the sense that both integrate computer science and biology in order to solve a few of the most important health problems of the 21st century. This is the reason for the increasing demand for specialized scientists in these fields.

Not only the academy sees interest in these subjects but also the market. Big companies such as DNAnexus which provides a cloud-based platform for analysis and management of large volumes of DNA sequence data; Seven Bridges Genomics which provides end-to-end bioinformatics solutions including access to datasets, analytic workflows, algorithms, cloud-computing infrastructure, and scientific support. Not only big and well-funded companies are repeating the gold rush but also startups. For example Israel, a country famous for its startups, is beginning to show first glances of nano-bioinformatics startups. One can mention Datos for its remote sensing wearables which analyzes important indices for patients with heart illnesses, or Dario which is trying to stabilize diabetes using dedicated hardware connected to a mobile phone that performs real-time analysis.

whether in an academy or the free market, as a nano-bioinformatics researcher you will handle the following issues in the journey achieving the desired solution:

Collecting data

Data in general and medical data, in particular, is precious and quite expensive. Such a phenomenon occurs because of hospitals, government offices, etc. that use not up to date systems and methods while collecting data. While there is indeed a lot of data available, it is often kept unpublished because of legal status.

Cleaning and organizing data

Well, congratulations, you have data! Now, when you look at the data, try not burst into tears. Usually, it takes much effort and time to clean up and organize the data in such a way to analyze it.description

Analyzing the data

The main problem that one encounters with big data is when trying to intuitively guess which part may be useful and how. Analyzing data is constructed from 2 smaller tasks. The first part is to find “interesting” relationships in the data. Such relationships between different parts of the data are the first building block for the second and harder task of finding patterns and generalizing them from just data to a hypothesis.

From findings to a product

A hypothesis is an important finding for itself. But, when we started this journey, we had an objective in mind. Now, using the hypothesis, we should develop a solution for our problem. When the data is the major part of the product but the human factor (UX\UI), the technical factor (computing power, costs) and others also come into play, can make this step very challenging.

As one can see, the process from achieving data and up to constructing a solution or a product is difficult and time-consuming. But no worry! We won’t leave you alone… here are a few global ideas for handling some of the many issues described.

A useful approach to find and collect data is by taking advantage of crowdsourcing platforms. Using a network of people around the world that contribute or generate data, is usually a fast and not too expensive option. Now, to be able to clean and organize the data there is no one correct way, but start by displaying the data in tables or graphs in order to externalize some of the data’s properties. Removing outliers and transfer range values to a more standard range (for example [0,1]).

Having cleaned the data, try and find a correlation between different parts of the data. A trick is to notice how a variable may affect the results, or maybe a combination of variables. Finally, reaching a product or a solution involves craftsmanship, innovation, creativity, and persistence.

"Success is not final; failure is not fatal: It is the courage to continue that counts." Winston S. Churchill

In conclusion, nano-bioinformatics is a promising area for research and development both in the academy and the market. This field uses a state of the art (SOTA) techniques from many disciplines like Mathematics, Computer Science, Biology, Chemistry, Information systems\theory, and others according to the problem one is trying to solve.