Data scientist holding a Master's in Data Science from Rutgers University with expertise in Data Engineering, Machine Learning, and Statistical Analysis. Proficient in constructing robust data pipelines, optimizing advanced models, applying NLP techniques, creating chatbots, and developing impactful recommendation systems. A dedicated and experienced researcher with a sharp analytical mindset, excelling in complex project execution, and leveraging a diverse skill set that includes Python, cloud technologies, and cutting-edge data science tools.
Skills
Experience
Education
Data Analytics, Data Engineering, Data mining, Machine Learning, Deep Learning, Statistics, A/B testing, MLOps, Natural Language Processing, Computer Vision, Generative AI
Python, SQL, Pandas, Numpy, Scikit-Learn, TensorFlow, PyTorch, NLTK, OpenCV, LangChain, Streamlit, FastAPI, Matplotlib, GCP(Certified Data Engineer), BigQuery, Databricks, Collibra, ETL, Informatica Cloud (IICS)
R, Unix, MongoDB, PySpark, SageMaker, Docker, MLflow, Ariflow, Kafka, AWS, PowerBI, Tableau
Developed a chatbot with a Streamlit-based dynamic interface where users can engage in natural language dialogue to pose questions and gain insights about the input PDF file(s). Leveraged OpenAI’s embeddings to process the files into a FAISS vector store, implemented a RAG pipeline in LangChain with ChatGPT-3.5 and used prompting techniques to reduce token size
Built a classification model to predict if a customer is likely to leave the company based on certain user features. Explored different models ranging from simple logistic regression to complex tree based models like random forest and XGBoost to compare and identify best performing and most suitable model. Performed hyperparameter tuning and SMOTEEN to improve the performance by 15%
rohitmacherla125@gmail.com
© 2023 Rohit Macherla. All rights reserved