bioinformatics

Two years of research create lasting impact for Nigerian visiting scholar

June 25, 2024 By Rubinstein, Sarah (MU-Student) in #IAmScience Tags: bioinformatics, Bond Life Sciences Center, research, University of Missouri

By Sarah Rubinstein | Bond LSC

Michael Arowolo is a visiting professor in the lab of Dong Xu, a Bond LSC principal investigator. | photo by Braiden Wade, Bond LSC

The African proverb “it takes a village to raise a child” can especially apply in science where that village includes mentors like Dong Xu, a Bond Life Sciences Center principal investigator, who has trained hundreds of students and collaborators.

Michael Arowolo, is among those mentored, having spent the past two years in Xu’s lab as a visiting scholar. In August, he will take that experience with him to Xavier University of Louisiana as a new assistant professor.

“Dr. Xu has really impacted me in so many facets of my life,” said Arowolo. “He has given me the privilege of tapping into his knowledge.”

The Nigerian native landed at Mizzou because around 40 scientific publications credited to him caught Xu’s attention, and he was invited to be a visiting professor in July 2022. He quickly stood out amongst other people Xu had worked with.

“In Africa, the research infrastructure is not as advanced as the U.S. In that environment, if he could publish that many papers, I was very impressed,” said Xu, also head of Mizzou’s Digital Biology lab. “He’s a self-starter, he actually takes initiative. I don’t need to motivate him to work hard.”

With experience from earning his Ph.D. in Nigeria, Arowolo scrambled between roles as a lecturer, researcher, exam officer and more. Working at the Bond LSC was a change of environment for him that he was excited to take on. He has made it a practice to come into the lab around 7:30 a.m. and leaves around 5 p.m. to meet his work needs. He spends his time mostly working on his computer, coding.

While visiting professors typically stay less than a year, Arowolo’s stay was extended because felt he still had more to contribute.

Arowolo’s research focus at Mizzou has been discovering innovations in biological pathways. To break it down, he collects information on gene interactions into a database using artificial intelligence to develop models such as Siamese Neural Network for identification of relevant genes. That information can help scientists access a multitude of information in one place to create targeted treatments and drugs for diseases.

“They won’t need to go through the back end and stress themselves with ‘What is all this computational jargon, what is all this coding?’” he said. “They can have a platform that they can easily interact with…and get the results they need to get.”

AI has become a major part of Arowolo’s work. He is developing his own large language model, using an advanced retrieval augmented generative mode that identifies and recognizes pertinent genes and describes its relationships with specific biological processes in human cells.

He recognizes how AI has been taking the world by storm, and he wants to use it to help people.

“Instead of just thinking that the world is over, that AI will take over, before AI takes over, we will tap into AI and be the speaker for AI,” he said.

Arowolo has also expanded past his computational work and has been collaborating with the Mizzou School of Medicine on a new project. His team proposed a medication dispensing machine that would help Alzheimer’s patients. His team is currently in the process of developing a product sellable to big companies like Amazon, he said.

“He transformed academic work into a commercial product,” Xu said.

But, on top of his research, Arowolo teaches undergraduate and graduate students and mentors Ph.D. students.

“It’s become a passion,” he said. “I’ve mentored over 100 students, and they are also doing well in their area of endeavor, most of them in Dr. Xu’s lab.”

Now, Arowolo will pass down what he’s learned from Dr. Xu to his new students at Xavier University of Louisiana in August.

Xu, on the other hand, is excited to see Arowolo take on this next step.

“I think the main reason you train people is not only so they will create a product but we hope to help them move onto a more independent position with a higher salary,” Xu said.

When Arowolo reflects on what he has accomplished so far, he envisions the work of him and his colleagues as the tool that will help people get access to the medications they need.

“I know there will be a day that will come, and we’ll have the right solutions to consider these diseases,” he said.

#IAmScience: Dong Xu

April 8, 2022 By Penquite, Cara (MU-Student) in Research Tags: bioinformatics

Data connects all: ‘Champion collaborator’ Xu bridges research disciplines with bioinformatics

By Cara Penquite | Bond LSC

Dong Xu extracts wonder from numbers with a keyboard and eager teams of scientists at his fingertips.

With his salt-and-pepper hair visible above the cubicle walls and his voice softly but steadily articulated, the beauty of bioinformatics takes shape in his mind although it might not be inherently evident in the rows of computers tucked into a small first floor lab.

Xu weaves a multifaceted masterpiece of research methodologies and makes sense of a sea of data from cell biologists, plant scientists, engineers and many more with hundreds of publications to show for it.

“For research, collaborating is key because the nature of research requires many views, skills [and] knowledge,” Xu said.

A Bond Life Sciences Center researcher and a newly endowed Curators professor of bioinformatics in the School of Engineering, Xu harnesses data to better understand biological systems such as revealing a better picture of protein dynamics or analyzing genetic codes for individual cells. With advanced data interpretation and machine learning, the Xu lab opens opportunities for more in-depth research across Mizzou and the world.

With so much existing on a microscopic level in living creatures and new technologies to collect data continually evolving, massive amounts of information are extracted in biological research. Xu’s expertise is invaluable to interpret massive amounts of information, but he takes it a step further predicting how pieces of biological systems interact.

His recent efforts include using deep learning — a subset of artificial intelligence and machine learning — to understand how the shape of protein binding sites change when molecules bind to other areas of the protein. This research, recently published in Nature Communications, and represents one of many collaborations the Xu lab is known for — in this case with Jilin University in Changchun, China.

The Xu lab’s help is crucial to researchers whose specialties lie in biological systems rather than computer science. Bond LSC Director Walter Gassmann, recognizes Xu’s niche.

“Biology has become so complex that you can’t be expert in everything,” Gassmann said. “The way to move forward is to collaborate with people, and that’s what Bond LSC is all about — bringing people with different disciplines under one roof, letting them rub shoulders and figuring out how best to solve a problem.”

With computational analysis being key to many research studies, the Xu lab collaborates with various labs at the research center.

“Dong is the champion collaborator [with] so many connections in the center,” Gassmann said.

Xu makes note of the particular importance of working together when mentoring young researchers. Students in the Xu lab work in teams and alongside biologists, allowing the researchers to learn from one another.

“There is actually an African saying, ‘it takes a village to raise a child,’” Xu said. “The same is true for mentoring students. It really takes many people to mentor a student. That includes not only community members and collaborators, but also peers.”

Xu fosters that shared work ethic by giving credit where it is due when it comes to research. While other institutions sometimes only give the lead investigator full credit for a project, Mizzou recognizes the work put in by each collaborator for any given project.

“MU is a very collaborative environment,” Xu said. “It really supports interdisciplinary research, [and] I don’t take it for granted. Interdisciplinary research usually relies on administrative support.”

Bioinformatics was more of an afterthought to Xu despite its prominence in his life now. His studies started with bachelor’s and master’s degrees in physics, and it was not until he started Ph.D. studies that his focus shifted.

“I was working on the biophysics of saturated proteins, and that’s a computational analysis,” Xu said. “I became very interested in computational work, so since then I’ve been working on this computational biology, or bioinformatics, for about 30 years.”

Xu’s drive to know all things data is revealed in his excitement to talk science.

With a small chuckle he noted that research does not usually bring money and fame. While he could use his computer skills to gain such rewards working for tech companies, he chooses to pursue science instead.

“To be a scientist requires passion,” Xu said. “You really need passion and to believe in the impact, the value of research. I do feel a reward in that regard.”

The Xu lab develops machine learning, a subset of artificial intelligence that involves teaching computers to mimic human intelligence. That starts with developing technology to collect data and facilitate its analysis.

This stretches into deep learning, more advanced machine learning that involves layers of neural networks. The input information is organized through these networks as the computer makes sense of the information.

People encounter deep learning in their day-to-day lives. The algorithm that sorts photos by faces in your phone’s photo app — that’s deep learning.

“It can start from these pixels and then can retrieve information like eyes [and] nose, and then can [find] a match,” Xu said. “It’s not only deep layers, but also the complex architecture of the network, so that’s why it’s called deep learning.”

A similar process is used in the Xu lab’s work. Rather than sort pixels to recognize faces, Xu’s work sorts data — like amino acid sequences — to predict protein interactions.

By finding trends in large sets of data, deep learning can speed up existing processes.

Mark Hannink, a Bond LSC principle investigator, recently saw this in action. Xu developed techniques to read scientific articles and generate diagrams of protein pathways described in the database. While people can read and summarize these articles, the process takes time and leaves the latest of the diagrammed pathways outdated.

With deep learning, each time a new article is published, the diagram could automatically update.

Working closely with Xu, Hannink saw his mentoring approach in action.

“What I’ve been impressed with is how good of a mentor he is to the students working on this project,” Hannink said.

While high-impact research is one goal, Xu also takes care to develop scientific minds.

“Not only [do] we produce papers [and] software, we also really need to produce high-quality, next-generation researchers,” Xu said. “So, our goal is to mentor people to be successful, and many of our lab members are really on the right track to be successful.”

Pancreatic tumor composition provides insight on treatment response

February 17, 2022 By Penquite, Cara (MU-Student) in Research Tags: bioinformatics, cancer, gene expression, mice

PanC_7 — Jing Zhou focuses the microscope through her computer. The microscope feeds its view directly to her screen so Zhou can see the pancreatic cells. | Photo by Cara Penquite, Bond LSC

By Cara Penquite | Bond LSC

Not all tumors are created equal, and potential treatments aren’t universal. When it comes to pancreatic cancer, surgery and radiation often do more harm than good due to its rapid growth and ability to spread to the liver.

Searching for alternative treatment options, Jing Zhou focused her research on immunotherapies. That led Zhou and her team to identify the sequences for over 9,000 cells from pancreatic tumors to out why some therapies don’t measure up.

“My professor said, ‘You have to do a lot of work to try and make sure your sample is ready [and] your experiments are ready. Otherwise, we [will] spend a lot of time, a lot of money, on [research] that doesn’t work,’” said Zhou, a NextGen graduate student who collaborated closely with Trupti Joshi’s bioinformatics lab at the Bond Life Sciences Center for the data analysis.

PanC_2 — Jing Zhou grabs a box with pancreatic tumor tissue samples in cover glass. Zhou can see the cancerous and normal cells by magnifying the samples in a microscope. | Photo by Cara Penquite, Bond LSC

Zhou identified differences between types of pancreatic tumors to understand why certain types of pancreatic cancer respond differently to treatment. Scientists analyzed RNA sequences for two different types of pancreatic tumors. While the project is continually evolving, her most recent findings were published online in the Journal of Translational Oncology.

Zhou focuses on a form of immunotherapy that increases the effectiveness of the immune system. Known as immune checkpoint inhibitors, the treatment turns off protein pathways that act as brakes on the immune system. In cancer patients, turning off the brakes allow immune cells, like T-cells, to attack more cancer cells.

“The cancer cells grow so fast, they expand a lot and that attracts a lot of T-cells. Normally, in a healthy tissue they don’t have so many T-cells, just a few of them to keep the body normal,” Zhou said, “But when you have a strong immune response, like in a tumor, it will have a lot of T-cells in that kind of environment.”

Not all types of pancreatic tumors respond well to immunotherapy.

“This was a very interesting project because what they were doing here is comparing four different types, [but] mainly two different types, of pancreatic cancers,” said Joshi, Bond LSC principal investigator and Assistant Professor at the Department of Health Management and Informatics in the School of Medicine and core faculty in MU’s Data Science and Informatics Institute. “One that is resistant to and one that is sensitive to this anti-PD-1 antibody treatment, what is known commonly as immunotherapy.”

PanC_4 — Jing Zhou selects a slide containing human pancreatic tumor cells to put into the microscope. While Zhou’s previous work was conducted in mice, surgeons now send human tissue samples to the lab for her to analyze as well. | Photo by Cara Penquite, Bond LSC

Tumors are composed of many types of immune cells mixed in with cancerous cells, and Joshi’s lab analyzed the composition to understand why two different types of pancreatic cancer react differently to treatment.

Unlike past analyses that relied on analyzing the tumors as a whole, the scientists used a process called single-cell RNA sequencing which allowed them to identify the various types of cells present and quantify the gene expression in individual cells. This quantification allowed the researchers to see which genes were turned on in each cell.

“The single-cell technology is the biggest advancement in sequencing technologies,” said Yuexu Jiang, a postdoc in Joshi’s lab who analyzed the single-cell RNA sequencing for the project.

Joshi suggested the analogy of a fruit smoothie to understand the concept of single-cell RNA sequencing. A fruit smoothie is a mixture of many fruits, and to understand what causes the flavor of the smoothie you must know how much there is of each fruit.

Single-cell RNA sequencing allows the researchers to quantify what types of cells are in the tumor, as well as identify the genetic makeup of the cells in the tumor.

“That really allows you to pinpoint that it is this particular gene expression in this particular type of cell which is really associated with what outcome you’re seeing in terms of these different tumors’ behavior,” Joshi said.

Pancreatic cancer is difficult to treat since surgery and radiation often do more harm than good. With its rapid growth rate and ability to easily spread throughout the body, a pancreatic cancer diagnosis comes with a 50% five year survival rate according to the American Cancer Society.

“I have been involved a lot of times in the surgery. I saw when the surgeon opened the [patient], and when they saw the metastasis, tiny things, on the liver, they just closed the [patient]. They won’t remove the cancer anymore,” Zhou said.

Immunotherapy provides another option for patients that need one.

“Getting a better understanding of the composition and what parts of the immune system are associated or involved in times when you see a cancer responding to immunotherapy […] that is really what this research was targeted towards understanding,” Joshi said.

This research published in the Journal of Translational Oncology November 9, 2021, under the title “Single-cell RNA sequencing to characterize the response of pancreatic cancer to anti-PD-1 immunotherapy.”

Technique connects DNA instructions to biological architecture in space: Core collaboration maps the future

November 10, 2021 By Lauren Hines in Research Tags: bioinformatics, Genomics Technology Core, Spatial Sequencing

By Lauren Hines | Bond LSC

The brain is a unique challenge. It has billions of cells with billions of different functions, making it hard to understand what is going on underneath. MU cores now offer 10X Genomics Visium technology which allows researchers to lay genetic data over an image of a tissue sample creating a map of the brain at a genetic level — or any other organ you can think of.

“Spatial transcriptomics is a great example of how science has become more collaborative in nature, certainly, and that requires cross-disciplinary interaction between researchers and other technologies,” said Nathan Bivens, director of the Genomics Technology Core at Bond LSC.

Teamed up with the Advanced Light Microscopy Core and Bioinformatics and Analytics Core, Bivens’s core played a key part in developing spatial sequencing on campus. The technique maps physical characteristics of any tissue to its corresponding gene expression, so researchers can understand what’s going on at the genetic level.

The collaboration started last year when researchers across the country reached out to the individual directors at Bond LSC. Each could handle their part of the project but realized they couldn’t do it alone.

“There are very few places that have all three of those groups, especially in one building where they work so closely together,” said Lyndon Coghill, director of the Bioinformatics and Analytics Core.

Spatial sequencing provides scientists with a way to point at tissue and say where which gene is being expressed and identify cell types. Researchers can understand how these cells are interacting with each other and what that means for the function of the organ overall.

“We can actually see which part of that tissue those differences are occurring in, which can dramatically increase our biological understanding of what’s happening because certain types of tissues are known to do certain things biologically. By connecting those pieces, it’s sort of like adding that final puzzle piece to getting a full picture of what’s happening between these things,” Coghill said.

Before spatial sequencing, there were a few ways to analyze the genes in tissues, but no real way to see what traits those genes were connected to or how these genes were interacting with each other.

While tissues have different cell layers and a lot going on, researchers would have to mix that all up kind of like using a blender. Take a simple leaf for example. It would be ground up with mortar and pestle then put in a solution to be sequenced. By analyzing the mixture, they could see all the genes together, but lose definition and cell-specific expression of genes. Essentially, it created a genetic average of everything, and you couldn’t see how genes were expressed closer to the stem versus farther away or in a specific structure.

“Spatial transcriptomics is just the right approach to study gene expression inside diverse cell populations like the brain or small brain regions,” said Alexander Jurkevich, associate director of the Advanced Light Microscopy Core. “It really opens great perspectives for studying, knowing and learning new functions.”

The key to this technology is having equipment and expertise for imaging, DNA analysis and data organization all under one roof.

The process starts in the Advanced Light Microscopy Core where Jurkevich thinly slices frozen tissue, puts these slices onto very special slides and stains it with a dye so it can show up underneath the microscope. After taking some pictures, it gets sent off to Bivens in the Genomics Technology Core.

Core staff then enzymatically removes the cell membranes of the tissue to release genetic material. Remember those special slides? Even though the tissue is now broken down, those slides have dots underneath the tissue that capture the expressed genes and assign a unique barcode to each point of the tissue. That way, researchers can analyze the data but still know which gene goes where.

Finally, all this data — from the DNA analysis, the images and the barcodes — all go to Coghill in the Bioinformatics and Analytics Core. Coghill and his team work to organize the data and create a map of the tissue and its genetics.

Jurkevich, Bivens and Coghill are looking towards seeing this technology map tumor treatment, Alzheimer’s in the brain and other disease studies.

“I really do think this connection of the raw data with a biological architecture is where we’re going to make those kinds of leaps forward not just in discovery, but also in potential clinical applications,” Coghill said.

Now, the cores have finished their first attempt at this huge collaboration. So far, it’s been successful, but they want to streamline the process. The biggest hurdle has been finding times when all three cores are available to work on a new project.

“Working on this technology was a great opportunity to work with different research cores … for us to learn something new and generate a product which is unique and very complex,” Jurkevich said.

Spatially resolved transcriptomics technology was declared as the Method of the Year 2020 by the Nature Methods journal.

#IAmScience Li Su

October 15, 2021 By Lauren Hines in #IAmScience Tags: bioinformatics

By Lauren Hines | Bond LSC

The work was tiring. The hours were long. However, Ph.D. candidate Li Su wasn’t affected by any of it. She was in her element

During her undergraduate degree in China, Su studied turfgrass science.

“There was a chance for undergraduates to do some research project, so I tried it and, although it was exhausting, I stayed in the lab and time just passed,” Su said. “I felt quiet and at peace. I kind of enjoyed it.”

As part of the Dong Xu lab at Bond Life Sciences Center, Su works on statistics and data analysis for many research studies throughout Bond LSC.

Originally from China, Su moved to Springfield, Missouri in 2016 to earn her master’s in plant science at Missouri State University. Once she graduated in 2018, she moved to Houston, Texas to work at a biomedical research institution. After a while, she applied for graduate school but wanted to go in a new direction.

“While I was in Houston, at that job, I was confused,” Su said. “I was just thinking about my skills, what I liked to do in the lab and what will make me survive … I realized even a lot of postdocs or senior graduate students were kind of limited in the statistics and data analysis, so I tried to figure out how to do those things.”

Su switched her focus, was accepted by Mizzou in 2020 and soon found her place in the Dong Xu lab.

“As we are trying to handle this big data, the main weapon for us is coding,” said Juexin Wang, Dong Xu’s lab manager. “So, when we are trying to deal with that big amount of data, we have to highly rely on the coding skills and [Su] does that very, very well. She is learning fast and uses all her resources to learn that.”

Su joined the lab while it was strictly Zoom lab meetings and everything was remote. Despite the digital barriers, Su stood out to Wang.

He had found a paper where he believed the lab could replace its methodology with theirs and make the study stronger. Wang mentioned this to Su over Zoom, not thinking much of it.

“Probably weeks later, she came to me and she tells me many things about the other methodology,” Wang said. “So, I was really impressed.”

Su understands what it means to do good science in the lab and what that could mean for others.

“I think a lot of people I work with tell me to be honest with yourself about your science, about your work,” Su said. “I want some work to be like this, so you have a novel idea, you scientifically prove it and make the conclusion helpful to a group of people. I feel like if I have such work, I can be part of the [scientific] community.”

Even though Su isn’t working on any of her own projects right now, her main goal is to publish new and better papers during her Ph.D.

“Smarts, diligence, persistence — I think those are very, very key characteristics,” Wang said. “[Su] is making her weapons much more powerful and much sharper. I think she will get some very good achievements.”

#IAmScience Lyndon Coghill

June 24, 2021 By Suppes, Davis (MU-Student) in #IAmScience Tags: bioinformatics, Mizzou Research

Lyndon Coghill is the new Director of the Bioinformatics and Analytics Core, and he is already making big moves at Mizzou.

By Davis Suppes | Bond LSC

Lyndon Coghill’s official title may be Director of Informatics for the Bioinformatics and Analytics Core, but his job branches out much wider than just a single label. Even as an undergrad, Coghill wore many different hats.

“I was incredibly excited about the way that the MU Office of Research and Economic Development recruited me,” Coghill said, “With these types of processes you can get an idea as to whether or not an institution is actually committed and excited about building something out.”

With his experience and range of expertise, Coghill was an easy choice for Mizzou to fill the role of Director of the Bioinformatics and Analytics Core located at Bond Life Sciences Center.

Before he achieved his doctorate in biology, he completed his undergraduate degree in zoology with minors in microbiology and geology at Western Illinois University. For his dissertation, the research he conducted was focused heavily on evolutionary genomics. Simply put, he wanted to know how changes in the genome lead to changes in a physical organism that allow them to adapt better to different environments and conditions.

With his doctorate in biology, he would go on to his first postdoc at The Field Museum of Natural History in Chicago in 2013, and then on to Louisiana State University where he began his role as a senior post doctorate in 2015. He continued to diversify his portfolio there working with the department of biology, focusing on computational biology. He was then promoted to research data scientist which had him take on an even more computationally heavy role. With this, he was able to help biologists learn how to talk to computer scientists, and assist them with building collaborative programs together..

As director, Coghill’s mission is to provide bioinformatics and data science support to all researchers across the UM system. He is creating a central hub where faculty who want to conduct domain- specific biological or life sciences-related research that is computationally heavy can get the help they need to come up with solutions. He does this by helping researchers wrangle incredibly large datasets and by helping them understand what that data is telling them from an information perspective in a meaningful way.

Coghill mentioned how interim Vice Chancellor of Research and Economic Development Thomas Spencer also made a personal effort to make sure Coghill understood his vision going forward on campus “and for me that was enough of a selling point that I wanted to be a part of that,” Coghill said.

In addition to the thorough recruitment process, Mizzou’s facilities and access were other huge factors that Coghill was looking forward to once he got here. With a hospital, vet school and productive biology program all on the same campus instead of in different cities, Mizzou offers a unique opportunity to build the integrated program all in one place.

“We’re trying to reach out to every department on campus to build these relationships because you can’t have true integration of ideas and solutions if you don’t talk to everyone who might be a benefactor or have knowledge about that,” Coghill said.

Coghill believes that to create a phenomenal translational research program, this core must interact with all these programs so that experts of different fields can come together to collaborate.

“Informatics research, especially bioinformatics is a program that really forces you to keep one foot in both worlds of computer science and biology, and there’s a limited number of people who do that kind of work,” Coghill said, “I think that was one of the big pushes for getting my experience here for this position, to bring in somebody who could bring these programs together and integrate across all these different fields.”

Coghill is excited to be working with the variety of researchers and programs across the MU campus and UM System, and learning from them at the same time.

“We may not know their biological system as well as they do and we may not know the high performance computing system as well as a full-time systems administrator, but we know enough of both that we can communicate with both teams and make sure that we can help get the researchers from the starting point to a meaningful result,” Coghill said.

Their goal is to provide Mizzou and sister campuses with research support allowing faculty to build translational research programs using computing power and informatics. This core brings new opportunities for Mizzou students as well.

“We’re going to have programs for students that can rotate through as part of the Informatics and Data Science Institute,” Coghill said.

This means that students who are interested in research fields can get direct experience related to career possibilities outside of Mizzou and academics by working in this program.

“Students can come to us and learn basic coding skills, learn informatics and bioinformatics, and that will help them build a skillset that will make them quite employable,” Coghill said.

Between helping researchers in their labs and analyzing quantities of data they are gathering for the first time, Coghill has a variety of jobs he has to understand and execute.

“So, I am the guy who wears a lot of different hats and allows these researchers from different domains to talk to each other,” Coghill said. “We’re trying to help them get to the point where their work could be as big as they want.”

Mizzou and Coghill know that there is no way to push modern research without computing, especially at the scale research is done today.

It would be extremely rare for someone who has a doctorate and spent their life trying to understand how one particular part of a biological process works to also have a doctorate in computer science, “That’s where we help… we’re providing researchers with the tools to do research at a scale using computing power, and asking questions that for many, might have only been dreamed about at other times in their careers,” Coghill said.

#IAmScience Shawn Thomas

March 5, 2021 By Lauren Hines in #IAmScience Tags: bioinformatics, Brassicales

By Lauren Hines | Bond LSC

Social media botany advocate and self-proclaimed coffee snob, Shawn Thomas is the kind of person to find joy in everything.

Thomas graduated from the University of Georgia in spring 2018 and worked as a bioinformatics technician for a year with Jim Leebens-Mack before joining Chris Pires’ lab at Bond Life Sciences Center as a Ph.D. student in fall 2019. Ever since, he’s been studying how genome duplications affect plant traits in a certain group of plants that include broccoli, cauliflower and kale.

Genome duplications occur when DNA is copied multiple times in a plant when two species hybridize and come together. To study this, Thomas is looking at their wild distant relatives.

“Understanding these ancient genome duplications, how they happened and when they happened can give us an idea of how different evolutionary innovations occur,” Thomas said.

Thomas has been a part of the Pires lab for over a year now and has brought his own set of skills.

“The thing that I’ve been really impressed with him — well, two things — one is he’s very skilled with bioinformatics,” said Pires, Bond LSC principal investigator. “He doesn’t have a computer science degree, but he’s got math skills and coding and a couple of different computer languages for data wrangling genomes. But the other thing is he’s made some really hilarious YouTube videos.”

Mixing his love of plant biology and lab inside jokes, Thomas makes clever YouTube videos for fun and even for grants.

One video includes what it’s really like inside the Pires lab, despite Pires’ expectations. It involves a well-known lab joke about Thomas prioritizing his undergraduate work while putting aside his Ph.D. thesis.

Another video acted as a submission for the PACBIO 2020 Plant and Animal Sciences SMRT grant program. The video was about the need for a reference genome of Cakile — “Sea Rocket” — to enable new studies in polypoid evolution, plant migration and crop improvement. In Europe, arugula is known as “rocket,” and a closely related species to rocket is “Sea Rocket,” so Thomas didn’t miss the opportunity to have some fun with it.

Even outside of YouTube, Thomas is tech-savvy on many fronts.

“I like to play with computers, and I like genetics, so playing around with big data genomic data has been fun,” Thomas said. “The work that I do is more basic research, where we are trying to understand the fundamental concepts behind how plants work and how these different mechanisms work. For me, the curiosity of trying to understand how polyploidy works is what interests me.”

Thomas is active on Twitter talking about everything Brassica — the mustard family — to help get others engaged in the basic research of plant biology.

“When we go to a conference, I’m known for being too much of the Twitter guy, but he’s also in there tweeting all the talks,” Pires said. “So, he’s not afraid to be an advocate for botany. He clearly loves botany, and he wants to have an influence, not just a science influence but an educational outreach kind of influence.”

In the lab, Thomas often helps move things forward by reminding the group about journal clubs, reading papers and corralling the undergraduates through his love of coffee.

“I’m a bit of a coffee snob, so I like to experiment with coffee and try all the new different recipes and techniques,” Thomas said.

Before the pandemic, Thomas would regularly visit Shortwave Coffee and get a pour-over of Ethiopian coffee (he highly recommends it). Now, he makes his own each morning and talks with the undergraduates about coffee while distanced in the lab.

“He’s a good team player, and he’s very social,” Pires said.

While Thomas is still learning how to be a mentor and exploring the diversity of plants, he has even more ahead of him.

“I’m proud of what I’ve done so far,” Thomas said. “I definitely have a long way to go, but I find enjoyment out of being able to create a nice figure or graph or get a good result that works with the story that we’re trying to tell.”

#IAmScience Shuai Zeng

February 12, 2021 By Becca Wolf in #IAmScience Tags: artificial intelligence, bioinformatics

By Becca Wolf | Bond LSC

It’s not a straight line between basic research and Silicon Valley, but Shuai Zeng made the dots connect.

Last summer, Zeng, a Ph.D. candidate in computer science, had an internship at Google headquarters in Mountain View City, California, where he worked on an applied research team. There, he helped design and develop a state-of-the-art deep learning model about video recommendations for Google Ads and YouTube.

Deep learning mimics the workings of the human brain in processing data through artificial intelligence (AI). It is used in detecting objects, recognizing speech, translating languages, and making decisions. Examples of deep learning would be Amazon’s Alexa and the navigation abilities of self-driving cars, like Tesla.

“At Google, I felt like I was working at a college like Mizzou,” Zeng said. “Working there required me to learn a lot of new things in a very short time. I learned not only coding skills, but also creative, analytical, and research skills.”

Zeng currently works in the University of Missouri School of Medicine and Bond Life Sciences Center under Dr. Trupti Joshi’s guidance. He finds it beneficial to build a bridge between his knowledge of computer science with the likes of medicine and plant biology.

“I currently work on the infrastructure for collecting and integrating multiomics data on soybean, maize, Arabdopsis, human, mouse, and many other organisms,” Zeng said. “The website provides a lot of interesting tools that allows the researchers to see, analyze, and store their data online. I can do the data analysis for them so they can go to the website and see the results from huge datasets and easily extract information that they need.”

Zeng also credits his time and experience at Bond LSC to helping him get the Google internship last summer.

“I work on a lot of different projects here, which is good because working with Google requires having a strong background,” Zeng said. “If I only work on the computer science part, it is not good enough because you need to learn a lot of other things as well.”

Zeng even enjoys combining computer science with biology, despite the difficulty.

“Fortunately, there are a lot of very good students here at Mizzou, so I don’t need to learn biology on my own. I just ask them some questions about it and then I can get to work,” Zeng said.

His advisor, Trupti Joshi, appreciates his willingness.

“He’s a very dedicated and motivated student,” Joshi said. “He does a fantastic job of translating and understanding the deep purpose of what the data issues are. He’s been one of our really crucial contributors in the lab.”

Zeng enjoys the environment at Bond LSC because of its similarities to Google. At Bond LSC, he works in the Joshi lab, where he aids in data analysis. One of the main things he works on is the Soybean Knowledge Base (SoyKB) and Knowledge Base Commons (KBCommons) frameworks.

“He really contributes to new developments, new methodology implementations, and developing and maintaining some of these crucial frameworks that we use for collaboration with faculty here and also outside,” Joshi said. “He’s going to have a fantastic career trajectory with all the experience he has gained from these research projects and internships.”

Zeng hopes that career involves going back to a large company like Google after he graduates.

“In the next five years, I would love to get a full-time job as a research scientist at Google or Facebook and be learning as many new things as possible,” Zeng said.

Until then, he will continue learning as much as he can in labs here at Mizzou.

BIPS: Bringing Plant Science and Engineering Together

February 3, 2021 By Becca Wolf in Research Tags: bioinformatics, BIPS, computer science, plant science

Nick Dietz and Marianne Slaten observing a plant in the lab. | photo by Becca Wolf, Bond LSC

By Becca Wolf | Bond LSC

Technology advancements have always driven scientific discoveries in order to perform in depth research, but that has never been more true today.

“A couple of decades ago it was perfectly fine to be an engineer and a biologist and live in your own world,” David Mendoza said. “But as science has advanced, we depend more on mathematics and computer sciences now.”

Mendoza, principal investigator at Bond Life Sciences Center and associate professor of plant sciences, created a program to help develop those skill in the next generation of scientists.

Bioinformatics in Plant Sciences (BIPS) was born in 2016. The undergraduate program pairs plant science or biology majors with computer science or engineering majors. Through research, field trips, and journal clubs, undergraduate students learn how to collaborate on projects and how the two fields help each other.

While Mendoza is the PI who initiated BIPS through a NSF grant, students from all labs are welcome in this program. More recently, a second NSF grant in collaboration with Gary Stacey has allowed the program to expand. BIPS identifies labs that have projects with both computer and biology components for students to work on. Students can also come to BIPS with a project already in mind.

Graduate Student Mentors

BIPS is run by students, for students, with a handful of graduate student mentors and undergraduate students in the program.

The graduate student mentors run weekly meetings, invite in guest speakers, and organize the journal clubs, which is where students review scientific publications and discuss them. Graduate mentors, Marianne Slaten and Nick Dietz, have enjoyed their time and responsibilities at BIPS.

“We make sure they’re on track and don’t get stuck. We want them to become well-rounded researchers,” Slaten said. “It’s a really novel opportunity to jumpstart the next generation of researchers.”

While BIPS cannot go on a field trip this year or have in-person speakers, Slaten and Dietz have come up with alternatives to keep undergrads engaged.

“We’ve been doing workshops showing different bioinformatics tools that can be used to address different research questions,” Dietz said. “We’re also having them do journal club, which gives them a better understanding of the literature surrounding what they are studying. We’re trying to normalize the experience as much as we can, even though everything’s virtual right now.”

A Bioinformatics in Plant Sciences (BIPS) meeting in progress. | photo courtesy of Nick Dietz, Bond LSC

Undergraduates

Undergrads are put into research teams and begin working.

Many projects start with a biology student cataloguing physical plant traits and a computer science or engineering student creates a way to take images of the plants and organize data and information.

“Some of the files you work with are so big you can’t even open them in Excel, so there’s always room for computational people that really know how to harness all that data,” Slaten said. “The files are just so big that your computer crashes. It takes additional skills to know how to deal with that data.”

While a majority of their projects focus on phenotyping, BIPS is looking to branch out into other areas.

“A big problem in biology right now is big data, which you can’t get through that if you don’t have computer science,” said Maddy Creach, a junior from Walter Gassmann’s lab at Bond LSC. “It’s interesting to see the way computer scientists look at problems and that has definitely made me a better researcher, especially since I know 1,000% more about computer science than I did when I started.”

Creach has found that her two years in BIPS helps her think about new ways to think about research.

“A big thing that I have gotten out of BIPS is how to manage my own project,” Creach said. “It’s on me to communicate with my partner and get stuff done and put work towards it, because no one else going to tell me to do it.”

In addition to working on their research and participating in journal club, the undergraduate students are expected to make a poster each year to present their research either at Life Sciences Week or at the Undergraduate Research Forum. Since the pandemic has limited or canceled these events, the students now submit a video of their work online for others to watch.

BIPS’ Impact

Building research and collaboration skills has helped everyone involved.

“I’m learning as much from them as I’m teaching them, so that’s been really awesome,” Slaten said.

Students also hone their communication skills.

“I didn’t realize being a good mentor is a skill set, it’s something you can cultivate over time,” Dietz said. “Oftentimes as students, we think of mentors as either good or bad, you either have it or you don’t. But it’s actually a skill that you can develop over time and I’ve picked that up since I’ve been helping run the program.”

But all remember it’s a learning process.

“Everyone’s really fun and easygoing, no one is judgmental because there is a range in expertise,” Creach said. “There are no dumb questions.”

BIPS has helped many students become well-rounded in their research abilities and is always looking for more students.

“The world is a big place that is moving at a fast pace, and if they don’t get on the train, they are going to miss it,” Mendoza said. “It’s that simple. There’s so much happening in real time, that if you don’t learn how to integrate technology into your research, you’re going lag behind.”

For more information on BIPS or to apply, contact David Mendoza or go to Bioinformatics in Plants (BIPS).

Connecting the World Through the Cloud

December 3, 2020 By Becca Wolf in Research Tags: bioinformatics, COVID-19, high-throughput phenotying

Maria Clare Lusardi, an undergraduate student in the Mendoza lab, uses the cloud, a program apart of CyVerse. | photo by Becca Wolf, Bond LSC

By Becca Wolf | Bond LSC

Clouds come in many shapes and sizes. Some are big and fluffy, others dark and ominous.

Or, as in David Mendoza’s case, the cloud is a hub of experiment information.

Mendoza, an associate professor of plant sciences and scientist in Bond Life Sciences Center, recently joined the CyVerse, a cyberinfrastructure system used by Mendoza’s lab that acts as and allows his team and collaborators to see data in real time. CyVerse, funded by the National Science Foundation, allows life science researchers to share and store their research data in the cloud.

This sort of system has come in quite handy for Mendoza as the Covid-19 pandemic limited the number of people allowed in a lab last spring, forcing him to look for a better way to work remotely but still together.

“These difficulties make you more creative. Covid-19 is a serious, terrible thing, and I could tell my lab was frustrated because they wanted to continue working, but couldn’t because of the restrictions,” Mendoza said. “We needed this. We started implementing the cloud and the opportunities to keep working and collaborate continued and worked out nicely.”

Mendoza connected the cloud to the robots his lab uses to take photos of experiments. These photos are sent every other hour to the cloud, so Mendoza and collaborators can check in on experiments from anywhere in the world.

To get connected, Mendoza brought in Drew Dahlquist, an undergraduate student, to help.

“As a computer science student, the stuff I’m doing feels very natural, but I didn’t realize how much other areas of science could use it. Before I came on, they were storing all the images just on a laptop in the lab and sending them around on thumb drives, which can be very inefficient,” Dahlquist said. “It’s going to greatly improve what David’s doing in the lab.”

Dahlquist has been transferring Mendoza’s data from the thumb drives to the cloud, a process he plans on being done with by the end of the semester.

“Hopefully, I can start tweaking it a little more to refine it and then just see what other needs pop up in the future,” Dahlquist said.

The Project

The main task benefitting from this new approach is a high-throughput phenotyping project, led by Landon Swartz, an undergraduate student in his lab.

“I was kicked off campus when we had quarantine last semester, so all of my projects stopped suddenly,” Swartz said. “I was just stuck at home doing remote stuff, which was not helpful. So that’s where Drew came in to do the cloud stuff. We realized that this is going to be a regular thing for the next few years and will likely happen again in the future.”

The cloud helps Swartz and Mendoza because instead of waiting a week to see how an experiment is doing, it receives frequent updates, so they can make changes to an experiment in real time instead of after the fact.

This is beneficial when they observe how leaf color changes when over or underexposed to nutrients like iron. This can cause chlorosis — when the leaves turn yellow from lack of iron and die. The cloud allows them to see if chlorosis is occurring at a different rate than they expected. With this information, they can change the parameters and continue the experiment without any issues.

“We are establishing protocols where first thing in the morning the machine is going to send me an email with the first photo of the day and to say that the robots are okay. Then, at night before bed, I’ll get the last picture of the day,” Mendoza said.

The camera used to take photos of experiments in the Mendoza lab. These photos are then sent to the cloud. | photo by Becca Wolf, Bond LSC

Collaboration made Easy

In addition to making experiments more accessible for the Mendoza lab, the cloud allows the lab to easily share data with collaborators.

“We started to realize that having our phenotyping robots be controlled remotely or be able to put our data onto the cloud would be super useful, not just for our lab, but also for labs around the world,” Swartz said. “So, maybe the U.S. goes into quarantine, but we have a sister lab in South Africa that is able to pick up our data where we left off and continue to run experiments for us.”

Having multiple labs look at the data at the same time helps create more ideas and exposes more people to those ideas.

“Our goal is to put the data out there and let more people try different approaches to analyze the data so that all of us as a field, can decide what is the best way to do standardized experiments,” Mendoza said.

Also, with the sister lab on another continent, there are people who can check in on the experiments around the clock.

“For instance, the lab in South Africa can see the data in real time. They can see if we need to change the light, increase the light, or make the night longer, and they could also control the nutrients among other things,” Mendoza said.

The cloud will only help experiments in the future.

“The internet is a great resource to collaborate. It’s not often used like that, but we can put data that we’re getting in the moment into this huge database that anyone across the world can look at. It really changes the whole game of science,” Swartz said. “It’s no longer about who can publish things first, it has really become an everyone’s in it together kind of thing. And that’s where I really would like to see this go, and especially in the next 10 or 20 years.”

While it started because of a pandemic, Mendoza sees it as a useful tool regardless of lab restrictions.

“Now that we’ve seen the possibilities, I don’t think we’re going back just because of the amount of experiments that you can do,” Mendoza said. “We actually plan on building more machines with more sensors, so it’s not going away anytime soon.”

1 2