Skip to content
View samiit's full-sized avatar

Block or report samiit

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
samiit/README.md

Sam Mathew

๐Ÿ‘‹ Hello! I'm Sam, a Full Stack Data Scientist with a background in Chemical Engineering, currently pursuing an M.Sc. in Polymer Science at Freie Universitรคt and Humboldt Universitรคt Berlin. I'm passionate about solving complex problems at the intersection of data science, explainable AI and materials science.

๐Ÿš€ About Me

  • ๐Ÿ”ฌ Full Stack Data Scientist with experience in NLP, medical entity extraction, and patient-study profile matching
  • ๐ŸŽ“ Currently pursuing M.Sc. in Polymer Science at FU and HU Berlin
  • ๐Ÿ‘จโ€๐Ÿซ Regular corporate trainer in Generative AI, Causal Discovery and Inference, Linear Algebra, and Machine Learning
  • ๐Ÿ’ป Proficient in Python, with expertise in Pandas, NumPy, Langchain, and FastAPI
  • ๐Ÿงฎ Strong background in mathematical modeling of physical systems and optimization
  • ๐Ÿค– Experience with Large Language Models (RAG and Agent)
  • ๐ŸŒ Multilingual: Fluent in English, Hindi, Malayalam; Proficient in German, Tamil, and Telugu; still dabbling with French and Spanish!

๐ŸŽฏ Current Focus

I'm currently working on exciting projects that combine my expertise in data science with my studies in Polymer Science:

  • ๐Ÿงช Applying machine learning techniques to index and search polymer material properties
  • ๐Ÿ” Developing recommendation engines for polymer materials based on required properties
  • ๐Ÿ”— Integrating knowledge from thermodynamics, chemistry, and physics to create comprehensive models
  • ๐Ÿ“Š Utilizing data-driven approaches to accelerate materials discovery and optimization
  • ๐Ÿ” Exploring Causal Discovery and Inference techniques in materials science and beyond

This interdisciplinary research aims to bridge the gap between traditional polymer science, causal inference, and cutting-edge machine learning techniques, potentially revolutionizing how we design and select materials for specific applications.

๐Ÿ› ๏ธ Skills

  • Python (Pandas, NumPy, Langchain, FastAPI)
  • Mathematical modeling and optimization
  • Large Language Models (RAG and Agent)
  • Natural Language Processing
  • Azure DevOps and AWS
  • Data reconciliation and process optimization
  • Image processing and deep learning
  • Polymer science and characterization
  • Causal Discovery and Inference

๐Ÿ”— Projects

Here are some projects I've worked on:

  1. Polymer Property Predictor

    • Machine learning model to predict polymer properties based on chemical structure
  2. Medical Entity Extraction

    • NLP project for extracting medical entities from clinical texts
  3. Patient-Study Profile Matching

    • AI-powered system to match patient profiles with suitable clinical studies
  4. Water Network Management Optimization

    • Large-scale integer optimization for efficient water network scheduling
  5. Blast Furnace Data Reconciliation

    • Data reconciliation project for improving blast furnace efficiency

๐Ÿ“ซ How to reach me

๐ŸŒŸ Interests and Fun Facts

  • ๐Ÿ“š I'm deeply interested in causal inference and its applications in data science. Judea Pearl's "The Book of Why" has been a significant influence on my thinking in this area.
  • ๐Ÿง  I love exploring the intersection of machine learning, causal inference, and materials science.
  • ๐Ÿ“– My reading interests span history, philosophy, technology, and scientific advancements.
  • ๐Ÿง— In my free time, you can find me hiking or cycling.
  • ๐ŸŒ I've lived and studied in India, Germany, and the Netherlands.
  • ๐Ÿงฌ I'm fascinated by the potential of combining materials science with machine learning and causal inference to solve real-world problems.

Feel free to explore my repositories and don't hesitate to reach out if you'd like to collaborate on a project, discuss the exciting world of polymer science and machine learning, or explore the depths of causal inference!

๐Ÿ“š Selected Publications and Recognition of Contributions
  1. Sujan Hazra, Prakash Abhale, Sam Mathew and Shankar Narasimhan, "Application of data reconciliation and gross error detection techniques to enhance reliability and consistency of the blast furnace process data", Asia-Pacific Journal of Chemical Engineering, 2021

  2. Pallab Sinha Mahapatra and Sam Mathew, "Activity-induced mixing and phase transitions of self-propelled swimmers", Phys. Rev. E, 2019, Vol. 99, 012609

  3. Pallab Sinha Mahapatra, Ajinkya Kulkarni, Sam Mathew, Mahesh V. Panchagnula and Srikanth Vedantam, "Transitions between multiple dynamical states in a confined dense active-particle system", Phys. Rev. E, 2017, Vol. 95, 062610

  4. Pallab Sinha Mahapatra, Sam Mathew, Mahesh V. Panchagnula, Srikanth Vedantam, "Effect of size distribution on mixing of a polydisperse wet granular material in a belt-driven enclosure", Granular Matter, 2016, Vol. 18, 30

  5. Pramode K Das, Sam Mathew, A J Shaiju and B S V Patnaik, "Energetically efficient proportional-integral-differential (PID) control of wake vortices behind a circular cylinder", Fluid Dynamics Research, 2015, Vol. 48, 015510

  6. Sam Mathew, B S V Patnaik and T John Tharakan, "Numerical study of air-core vortex dynamics during liquid draining from cylindrical tanks", Fluid Dynamics Research, 2014, Vol. 46, 025505

  7. Sam Mathew, Ganesh Visavale and Vijay Mali, "CFD Analysis of a Heat Collector Element in a Solar Parabolic Trough Collector", International Conference on Applications of Renewable and Sustainable Energy for Industry and Society, Hyderabad (REIS-2010), 2010

  8. Sam Mathew, Ganesh Visavale and Vijay Mali, "Making order in the cabinet : Integrating CFD in the green energy design process for food industry helps identify and fix causes for uneven drying in a Solar Cabinet Dryer", Ansys Users Conference, Bangalore, 2010

  9. Raja Gopal Rayavarapu, Wilma Petersen, Constantin Ungureanu, Janine N. Post, Ton G. van Leeuwen, and Srirang Manohar, "Synthesis and Bioconjugation of Gold Nanoparticles as Potential Molecular Probes for Light-Based Imaging Techniques", Int. J. of Biomedical Imaging, 2007, 2007:29817

Popular repositories Loading

  1. helmet-detection helmet-detection Public

    Detect helmets and person at construction sites

    Jupyter Notebook 1 1

  2. test-repo test-repo Public

  3. datasciencecoursera datasciencecoursera Public

    The Data Scientistโ€™s Toolbox account

  4. datasharing datasharing Public

    Forked from jtleek/datasharing

    The Leek group guide to data sharing

  5. thrust thrust Public

    Forked from NVIDIA/thrust

    Thrust is a parallel algorithms library which resembles the C++ Standard Template Library (STL).

    C++

  6. Wet_Granular_SPP Wet_Granular_SPP Public

    Combined code for wet granular and self-propelled particles

    C++