Name: Nishi Paul

Job Role: Data Scientist II

Experience: 2 Years 3 Months

Address: Bengaluru, India

Skills

Python 95%
Machine Learning 90%
Deep Learning 70%
NLP 70%
LLM and Transformer 55%
RAG and Langchain 45%
Cloud - GCP 50%
SQL 85%
Data Visualization and BI Tools 90%
Statistical Analysis 85%

About

About Me

I am a Data Scientist II at HPE specializing in data analysis, machine learning, NLP, and AI-driven solutions. My expertise includes Python, SQL, Visualization, Machine Learning, Deep Learning, and Gen AI (LLM). I focus on developing LLM-based solutions, automating workflows, detecting anomalies, reducing man-hours, and enhancing efficiency through data-driven insights as a part of my job. I have successfully built and fine-tuned models like FlanT5 and BERT for sentiment analysis and summarization, reducing external dependencies. I have developed question answering model using RAG solutions, and built using Azure OpenAI API and Langchain. My work in text classification has significantly improved automation, reducing manual efforts and increasing accuracy. I also manage data pipelines and oversee reporting systems, ensuring seamless data integration and timely business insights.
Previously, at HSBC and MiQ, I worked on data extraction, ETL, and business intelligence reporting. At HSBC, I optimized Python scripts, leading to a 40% performance boost and significant time savings, while also leading ML model development for anomaly detection and data quality improvements. At MiQ, I worked with programmatic advertising platforms like TradeDesk and Google Data Studio, creating reports and data visualizations that enhanced marketing strategies and increased lead conversions. My ability to extract meaningful insights from vast datasets has consistently driven efficiency and impact across projects.
I am highly team-oriented and believe in collaboration, punctuality, and timely delivery. My strong problem-solving skills, accuracy, and innovation have earned me rewards from every company I've worked in. I am also a dedicated freelancer and tutor, sharing my knowledge with aspiring data scientists. With a natural inclination for continuous learning, I stay updated with the latest advancements in AI and data science, always striving to develop impactful and efficient solutions.

  • Profile: Data Scientist II
  • Domain: Technology & Compute and Quality Checking of Servers
  • Education: Masters in Computer Applications
  • Language: English, Bengali, Hindi
  • Interest: Traveling, Book Reading, Painting, Freelancing, Teaching

0 +   Projects completed

LinkedIn

Experience


2023 December - Present

Data Scientist II

Hewlett Packard Enterprise (HPE)

HPE provides edge-to-cloud solutions, including computing, storage, networking, AI-driven services, and a range of enterprise servers across multiple generations.

    At HPE, I designed and deployed real-time classification models including LightGBM, Random Forest, and XGBoost to categorize incoming customer cases, which reduced manual triage time by approximately 45% and improved prioritization accuracy to around 92%. This involved extensive feature engineering, including dimensionality reduction through PCA on both textual and categorical metadata. I also utilized Impala SQL and Dataiku DSS to preprocess over 10,000 customer case logs weekly and applied NLP models such as Word2Vec and GloVe for contextual text embedding and pattern recognition in customer feedback. To uncover latent trends in unlabeled server issue data, I conducted unsupervised learning using clustering algorithms such as KMeans, DBSCAN, and Hierarchical Clustering. Analyzing over 2.5 million server log entries, I identified six key failure clusters that enabled the development of automated alerts and contributed to a 28% reduction in unresolved server anomalies. I also built LLM-based tools using Anaconda on NVIDIA L40 GPUs to assist support teams in product inquiries and case resolutions. Using LangChain with Azure OpenAI API, I implemented retrieval-augmented generation (RAG) pipelines for question answering, while fine-tuning models like Flan-T5 for sentiment analysis and Gemma, Granite, and Llama for summarization and context retrieval, improving internal resolution rates by 73% across four departments. Additionally, I designed rule-based and statistical models to detect early indicators of customer issues, applying the Pareto principle to optimize prioritization and reducing detection time by 34%. I managed data pipelines integrating inputs from over six sources using Dataiku, ensuring data consistency and version control. To support data-driven decision-making, I developed and maintained three interactive Power BI dashboards tracking case inflow, model health, and team productivity. As the point of contact for four cross-functional teams, I led several generative AI initiatives covering summarization, sentiment analysis, question answering, and context extraction to enhance support operations and reduce external tool dependency.

2022 July - 2023 November

Data Analyst

HSBC

HSBC (Hongkong and Shanghai Banking Corporation) is a global financial services company offering banking, wealth management, and investment solutions across multiple markets.

    At HSBC, I worked as a Data Analyst focused on trade and transaction data, where I extracted and processed over 20 million records daily through secure API pipelines into Data360 ETL workflows. I performed extensive EDA using Python and SQL, applying over 70 internal validation rules to enhance data quality and ensure regulatory compliance—improving accuracy metrics by 35%. I built a regression-based anomaly detection model that reduced manual exception handling by 2–3 hours/day and increased risk flagging efficiency by ~50%. I also developed and deployed classification models (Random Forest, XGBoost, Logistic Regression) using PCA-transformed trade metadata, achieving 90%+ accuracy in transaction categorization. Leading a team of 3 analysts, we automated critical monitoring workflows, improving system reliability and reducing script failures by 40%. Additionally, I created 5 MySQL summary tables for downstream QlikSense dashboards used by 3 key business teams and optimized modular Python ETL scripts, cutting processing time by ~40%. My role involved close collaboration with stakeholders to deliver actionable insights and accelerate data-driven decision-making.

2022 January - 2022 June

Data Analyst Intern

MIQ Digital

MIQ Digital is a programmatic media company that provides data-driven advertising and analytics solutions for targeted marketing.

    As a Data Analyst, I worked on programmatic advertising data for four key clients, leveraging Python-based data analysis techniques to generate weekly and monthly performance reports. This work led to a 15–20% improvement in campaign efficiency. I extracted and stored large-scale advertising data using DSP platforms and Amazon S3, and conducted in-depth exploratory analysis on Databricks using PySpark and SQL, handling over 10 million records per week. To streamline operations, I automated data pipelines with reusable scripts, reducing reporting time by 30–40%. I created actionable visual insights using Power BI and Python libraries like Seaborn and Matplotlib, highlighting key trends in CTR, CPA, spend, and audience behavior. Additionally, I curated clean datasets for internal teams on S3 to support faster decision-making and performed variance and anomaly analysis that directly contributed to data-driven business optimizations and improved ROI.



Education


2019-2022

Masters in computer applications

University of Engineering and Management, Kolkata

Course Duration: Three Years.

Grade: First class distinction.

2016-2019

Bachelors in Physics Honors

University of Calcutta, Kolkata

Course Duration: Three Years.

Grade: First class.

Certifications

Certifications and Badges

Here are the concepts and skills I have learned.

Tensorflow Developer Certification by Coursera

I learned how to build and train neural networks using TensorFlow, how to improve network performance using convolutions as you train it to identify real-world images, how to teach machines to understand, analyze, and respond to human speech with natural language processing systems, and more!

NLP with Classification and Vector Spaces

The course covers fundamental NLP techniques, including text classification, sentiment analysis, and vector space models for word embeddings. It taught me how to represent and process text data using machine learning and deep learning models.

Machine Learning Specialist

I gained hands-on experience with various techniques used in Machine Learning, starting from Data Exploration to Exploratory Data Analysis to Feature Engineering and then to Model building, which is followed by Model evaluations and Model Hyperparameter tuning to finally fixing on required Model. I got an in-depth understanding of multiple ML methods including Dimentionality Reduction. This is a part of HPE Certification Program, which I achieved in my tenure at HPE.

Projects

Projects

Below are the sample Data Science and LLM projects.

Sentiment Analysis of User Reviews Using LLM

Performed Sentiment Analysis by finetuning models like BERT and DistilBERT using Reviews Dataset. The model is partially fine fine-tuned with positive, negative, and neutral labels only. It can be leveraged more!

Rude or Strict Chat Model that will tell you to Work!

Hosted on Hugging Face with Streamlit, I built a rude chatbot using the Gemini Model and API. No matter what you ask, it will always tell you to work!
You will get to query twice only due to limited token calls Try it out HERE

Scrape the text Content of a website

This is to scrape textual data from any website. Click Here to View the Hosted Project on Streamlit
You want to have the content of the blog? Copy and paste the URL and you will get the content.


Details of Multiple Companies at one place

Get company names, company website, and a brief analysis of company growth at one place! With more than 80,000 company names listed across different industries, your company search will be easier. CLICK HERE to see how it may help you

Strategic Marketing Campaigns with ML

This project simplifies profitable marketing by analyzing customer behavior before investment. I conducted thorough data analysis and applied multiple classification ML models to predict whether a customer will engage with a campaign, ensuring smarter and more effective marketing decisions.



More projects on Github

I love to solve business problems and work in multiple domains

Connect with me to discuss and see how I can help you out with your tasks.

GitHub

Activities

Activities

Image

Data Science Tutor (Part Time and Freelance)

I have a passion for sharing knowledge and have tutored numerous students both online and offline. Collaborating with various institutions, I have conducted online classes on Python, Data Analysis, Machine Learning, Deep Learning, NLP, LLM, and Transformers (Basic level), helping students build a strong foundation in these areas. Additionally, I have delivered several offline talks on AI and ML in various technical groups, engaging with aspiring professionals and enthusiasts to discuss industry trends and best practices.

I firmly believe that teaching is a two-way learning process, and sharing knowledge strengthens my own understanding while exposing me to new perspectives. This continuous exchange of ideas fuels my passion for learning and growth, encouraging me to stay updated with advancements in the field. By mentoring students and guiding professionals, I contribute to building a stronger AI and ML community while further refining my expertise in these ever-evolving domains.

Image

ML/DL and Data Analysis Freelancer

As a freelancer, I have worked on machine learning, deep learning, and data analysis projects, gaining experience across various domains. Over the past two years, I have successfully completed more than 20 projects, delivering high-quality solutions tailored to client needs. Whether building predictive models, performing exploratory data analysis, or training machine learning models, I ensure that every project is executed with precision. My commitment to accuracy, timely delivery, and problem-solving has consistently resulted in satisfied clients who appreciate the reliability and effectiveness of my work.

Freelancing has been a valuable avenue for expanding my expertise across diverse datasets, models, and project scopes. Each project presents unique challenges, allowing me to gain in-depth insights into different data structures, feature engineering techniques, and model optimization strategies. This exposure has not only strengthened my technical knowledge but has also directly benefited my work in an office setting, where I apply these learnings to lead complex tasks with confidence.

Image

Blog Writer

I occasionally write blogs, sharing insights and experiences from my journey in Data Science and Data Analysis. While I try to maintain a writing routine, my strict schedule sometimes makes it difficult to stay consistent. However, whenever I get the time, I make sure to put my thoughts into words. Most of my writing is published on Medium, where I discuss various topics, from machine learning techniques to data visualization best practices. Additionally, I have collaborated with a few institutions, creating informative content tailored to help learners and professionals understand complex concepts in a structured and simplified way.

Click HERE to visit my Blog Page I believe that writing is one of the best ways to reinforce learning, as it helps me retain knowledge and refine my understanding. By documenting my thoughts, I not only solidify what I have learned but also create a resource that can benefit others in the field.

0 Achievements
0 Projects
0 Mentored Students
0 Written Blogs

Contact

Contact Me

Below are the details to reach out to me!

Address

Home Town : Kolkata, West Bengal, India

Work Location : Bengaluru, Karnataka, India

Connect with me via email or LinkedIn for official discussions

LinkedIn Profile

Email Address

nishi.paul.in@gmail.com

Download Resume

Resume



Have a Question? Click Here

If you are seeking assistance with a project, looking for a tutor for your institution or platform, or considering hiring me for a part-time, full-time, or contract position, please feel free to connect with me. You can contact me directly through my social channels or submit your query via the provided form.