Hello! I'm pretty new to data engineering other than my own attempts at workflow as it related to my specific thesis project. However, it's important to have an understanding of this as I want to break into computational biology in the context of the structure/function relationship of biomolecules and the biophysics of drug-protein interactions. There is a huge computational revolution going on here with ML techniques for computational modeling, and it's hard to keep up and understand which products and workflows might be best for a given data processing and analysis problem.