๐ Hello! I'm Sam, a Full Stack Data Scientist with a background in Chemical Engineering, currently pursuing an M.Sc. in Polymer Science at Freie Universitรคt and Humboldt Universitรคt Berlin. I'm passionate about solving complex problems at the intersection of data science, explainable AI and materials science.
- ๐ฌ Full Stack Data Scientist with experience in NLP, medical entity extraction, and patient-study profile matching
- ๐ Currently pursuing M.Sc. in Polymer Science at FU and HU Berlin
- ๐จโ๐ซ Regular corporate trainer in Generative AI, Causal Discovery and Inference, Linear Algebra, and Machine Learning
- ๐ป Proficient in Python, with expertise in Pandas, NumPy, Langchain, and FastAPI
- ๐งฎ Strong background in mathematical modeling of physical systems and optimization
- ๐ค Experience with Large Language Models (RAG and Agent)
- ๐ Multilingual: Fluent in English, Hindi, Malayalam; Proficient in German, Tamil, and Telugu; still dabbling with French and Spanish!
I'm currently working on exciting projects that combine my expertise in data science with my studies in Polymer Science:
- ๐งช Applying machine learning techniques to index and search polymer material properties
- ๐ Developing recommendation engines for polymer materials based on required properties
- ๐ Integrating knowledge from thermodynamics, chemistry, and physics to create comprehensive models
- ๐ Utilizing data-driven approaches to accelerate materials discovery and optimization
- ๐ Exploring Causal Discovery and Inference techniques in materials science and beyond
This interdisciplinary research aims to bridge the gap between traditional polymer science, causal inference, and cutting-edge machine learning techniques, potentially revolutionizing how we design and select materials for specific applications.
- Python (Pandas, NumPy, Langchain, FastAPI)
- Mathematical modeling and optimization
- Large Language Models (RAG and Agent)
- Natural Language Processing
- Azure DevOps and AWS
- Data reconciliation and process optimization
- Image processing and deep learning
- Polymer science and characterization
- Causal Discovery and Inference
Here are some projects I've worked on:
-
Polymer Property Predictor
- Machine learning model to predict polymer properties based on chemical structure
-
Medical Entity Extraction
- NLP project for extracting medical entities from clinical texts
-
Patient-Study Profile Matching
- AI-powered system to match patient profiles with suitable clinical studies
-
Water Network Management Optimization
- Large-scale integer optimization for efficient water network scheduling
-
Blast Furnace Data Reconciliation
- Data reconciliation project for improving blast furnace efficiency
- ๐ผ LinkedIn: linkedin.com/in/sammathewai
- ๐ GitHub: github.com/samiit
- ๐ I'm deeply interested in causal inference and its applications in data science. Judea Pearl's "The Book of Why" has been a significant influence on my thinking in this area.
- ๐ง I love exploring the intersection of machine learning, causal inference, and materials science.
- ๐ My reading interests span history, philosophy, technology, and scientific advancements.
- ๐ง In my free time, you can find me hiking or cycling.
- ๐ I've lived and studied in India, Germany, and the Netherlands.
- ๐งฌ I'm fascinated by the potential of combining materials science with machine learning and causal inference to solve real-world problems.
Feel free to explore my repositories and don't hesitate to reach out if you'd like to collaborate on a project, discuss the exciting world of polymer science and machine learning, or explore the depths of causal inference!
๐ Selected Publications and Recognition of Contributions
-
Sujan Hazra, Prakash Abhale, Sam Mathew and Shankar Narasimhan, "Application of data reconciliation and gross error detection techniques to enhance reliability and consistency of the blast furnace process data", Asia-Pacific Journal of Chemical Engineering, 2021
-
Pallab Sinha Mahapatra and Sam Mathew, "Activity-induced mixing and phase transitions of self-propelled swimmers", Phys. Rev. E, 2019, Vol. 99, 012609
-
Pallab Sinha Mahapatra, Ajinkya Kulkarni, Sam Mathew, Mahesh V. Panchagnula and Srikanth Vedantam, "Transitions between multiple dynamical states in a confined dense active-particle system", Phys. Rev. E, 2017, Vol. 95, 062610
-
Pallab Sinha Mahapatra, Sam Mathew, Mahesh V. Panchagnula, Srikanth Vedantam, "Effect of size distribution on mixing of a polydisperse wet granular material in a belt-driven enclosure", Granular Matter, 2016, Vol. 18, 30
-
Pramode K Das, Sam Mathew, A J Shaiju and B S V Patnaik, "Energetically efficient proportional-integral-differential (PID) control of wake vortices behind a circular cylinder", Fluid Dynamics Research, 2015, Vol. 48, 015510
-
Sam Mathew, B S V Patnaik and T John Tharakan, "Numerical study of air-core vortex dynamics during liquid draining from cylindrical tanks", Fluid Dynamics Research, 2014, Vol. 46, 025505
-
Sam Mathew, Ganesh Visavale and Vijay Mali, "CFD Analysis of a Heat Collector Element in a Solar Parabolic Trough Collector", International Conference on Applications of Renewable and Sustainable Energy for Industry and Society, Hyderabad (REIS-2010), 2010
-
Sam Mathew, Ganesh Visavale and Vijay Mali, "Making order in the cabinet : Integrating CFD in the green energy design process for food industry helps identify and fix causes for uneven drying in a Solar Cabinet Dryer", Ansys Users Conference, Bangalore, 2010
-
Raja Gopal Rayavarapu, Wilma Petersen, Constantin Ungureanu, Janine N. Post, Ton G. van Leeuwen, and Srirang Manohar, "Synthesis and Bioconjugation of Gold Nanoparticles as Potential Molecular Probes for Light-Based Imaging Techniques", Int. J. of Biomedical Imaging, 2007, 2007:29817