Intro
Who is Jayesh
I am Jayesh Jain — a Microsoft Certified Trainer, Senior Consultant, and Tech Educator with expertise across
cloud computing, data engineering, and advanced analytics. I currently work as a
Senior Consultant (Technical Lead – Applications) at BI4ALL, supporting the
Johnson & Johnson portfolio across MedTech, Pharma, and Innovative Medicine.
My work spans Azure & AWS cloud architecture, data engineering, application reliability, and
digital transformation across enterprise environments.
Alongside my industry work, I am a passionate educator with over 7 years of experience training
students and professionals in Data Science, Machine Learning, AI, and Programming.
I have trained more than 100 students across top universities and have served as a
visiting faculty at leading business schools in Mumbai.
jayeshjain1529@gmail.com
Interests
Programming & Data Science
Programming is my core passion and something I strive to grow in every day.
I specialize in Machine Learning, Deep Learning, Cloud Architecture, and Data Engineering.
I am a 7× Microsoft Certified Professional and a Microsoft Certified Trainer (MCT).
I have published multiple research papers in international journals and worked on projects
across cloud, analytics, and AI ecosystems.
Ranked 5th among 200 participants at a district‑level coding marathon.
Teaching & Training
Teaching is one of my biggest passions. Over the past 7 years, I have taught programming,
data science, and AI to students and professionals through seminars, webinars, workshops,
and university‑level courses. I currently serve as:
• Visiting Faculty – N.L. Dalmia Institute of Management (3.5 years)
• Visiting Faculty – School of Data Science & Business Intelligence (4 years)
• Trainer – Indian Navy (IMS Training: AI, Data Analytics, Cybersecurity)
I have led AI/ML workshops, faculty development programs (FDPs), and curriculum design initiatives across universities.
Awarded “Most Valuable IT Trainer”.
Work Experience
Current
I am currently working as
Senior Consultant (Technical Lead – Applications)
Company: BI4ALL (Client: Johnson & Johnson)
Technologies: Azure, AWS, Databricks, ADF, Power BI, SQL, PySpark
Work Involved
- Architecting and optimizing enterprise-grade cloud solutions across Azure and AWS for MedTech, Pharma, and Innovative Medicine divisions.
- Leading Application Reliability Engineering (ARE) and platform operations for 20+ business‑critical applications ensuring scalability, high availability, and security.
- Designing and managing data engineering pipelines powering analytics, dashboards, and decision systems across multiple J&J business units.
- Collaborating with cross-functional stakeholders to align architecture, business logic, and strategic technology initiatives.
- Driving modernization, performance tuning, and cost‑optimization initiatives across cloud ecosystems.
- Implementing CI/CD pipelines using Terraform, Jenkins, Git workflows for automated deployments.
- Leading troubleshooting, impact assessments, and solution design for large-scale application enhancements and integrations.
Previous Role
Senior Data Engineer at LTIMindtree
Technologies: Azure, Databricks, Power BI, SQL
Work Involved
- Designed and implemented Azure infrastructure for enterprise applications.
- Developed scalable and optimized ADF, PySpark, and Scala pipelines with strong data quality validation.
- Performed analysis on third‑party data solutions ensuring compliance with internal and client requirements.
- Analyzed complex datasets to identify trends, anomalies, and operational risks.
- Achieved 40% cost reduction and 30% performance improvement on Databricks through optimization techniques.
- Integrated CI/CD automation with Jenkins and Terraform.
- Implemented data archiving & compression frameworks achieving ~40% storage savings.
Internships
Research And Teaching Assistant
@ TalentHome Solutions LLP
Worked as a Research and Training Assistant across multiple IT technologies.
Trained 100+ students, delivered workshops across engineering colleges, built interactive online/offline courses, and mentored teams for live IT projects including digital marketing solutions.
Android Developer
@ Laqshya Infosoft Solutions
Completed intensive Android Development training and built MAKE MY PC – an application that allows users to design a virtual custom PC, compare specifications, and compute final pricing.
Projects
Real Time Twitter Sentiment Analysis
Use the twitter API to create endpoints and fetch real time tweets of any user. Tweepy python library was used to authentication and extraction. The tweets are then cleaned using regular expressions ( removing hashtags, @mentions , RTs, emojis and URLs). Used pandas to converts the raw data into data frame and used NLP and sklearn libraries to generate the subjectivity and polarity from the data strings. Made a scoring system to generate a new column ( sentiment ) which holds positive , negative and neutral and values based on the polarity score , generated graphs and word cloud for better data visualization.
Covid-19 Analysis using Big Data Tools
Novel Coronavirus or Covid-19 has created a pandemic leading to a worldwide
shutdown subsequently affecting all sectors. This has raised alarms over one of the most overlooked areas to focus: Healthcare Management.With the increase in the number of cases, this tremendously increased the demand
for beds and medical officials to handle the situation but predicting the demand for the next few months was very tough without the usage of data handling tools. This problem can be resolved by selecting precise big data tools to handle the data and data visualization so that the trend can be analyzed precisely and accordingly the demand would be satisfied.
Customer analysis on factors leading to purchase of vehicles using R programming
For different cars and motorbike companies, it is difficult to apply marketing strategies on a variety of customers because customers buy things based on certain attributes like annual income, children, and marital status. To predict the group of customers, customer segmentation uses clustering algorithms to differentiate customers based on their data and thus making it easier for cars and motorbike companies to address the customers and increase the sales and revenue.
Deep learning to get data enteries from paperback forms
To support digitization which is being impeded because of lack of basic knowledge of operating computers and being well versed with the internet especially in rural areas this is still prevalent. Minimizing the time and resources wasted due to long queues my project helps to automate redundant data-entry tasks which can be easily automated with the help of deep learning.
IOT based SMART HOME
Home Automation refers to the control of home appliances/ home devices through programming them as per our needs. This leads to an easy lifestyle for people living in that home as controlling devices/appliances over voice/gestures can lower the difficulty of performing tasks and thus saving time and energy. Home automation can be achieved using various degrees of intelligence and automation through a network. In this process, we make use of micro-controller to connect it with the network thus enabling us to program all the home appliances/devices as per our need. Home automation provides security, energy efficiency, and ease of use hence, it is adopted more. It also provides a remote interface to home appliances to provide control and monitoring on a web browser.
Social networking website for college
An online social networking service by which the user can establish a network among the people associated with the college. The students can post queries and doubts related to their studies on the website and teachers can respond accordingly. Students can view their time-table, syllabus, and workshops being organized in the college on the website.
Research
Using CRNN to Perform OCR over Forms
International Journal of Engineering Research & Technology
Description:
This paper presents the CRNN model used to perform OCR. The input image is a gray-scale image of size 128x32. After performing convolution via 5 CNN layers, the images are transformed into size of 32x256. The RNN part consists of 2 LSTM layers containing 256 nodes each to propagate information through the sequence and map the sequence to a matrix of size 32x80. Each element in the matrix is a score for the corresponding character(80 characters) at that particular timestep(32 time-steps). The CTC layer then uses this matrix to train along with the ground truth text. While inferring, it uses the beam search decoding algorithm to predict the output. The current model gives an error rate of 10.625% on IAM word dataset. This can be reduced by using Best Path Decoding or Word Beam Search algorithm in CTC. Since typically the words encountered in railway forms are proper nouns, Vanilla Beam Search algorithm is the best choice. An evaluation was done on forms filled by people having various handwriting styles. It was observed that handwriting styles matching the style of the IAM dataset,i.e. having more inter-word spaces and less intra-word spaces were segmented and recognized correctly.
Publication URL: Read it here
Automatic data collection from Forms using OCR
International Research Journal of Engineering and Technology
Description:
Optical Character Recognition(OCR) seems a very viable option for this case. However, blindly carrying out OCR won't produce any good results as there are many other elements in the form apart from the user's written data. This paper presents a structured way of scanning the paper-back form, collecting only the necessary, i.e. the user input data and update it to the database accordingly. The process involves using the scale-space technique to segment out the words from the input field and feeds them an input image to a CRNN(Convolutional Recurrent Neural Network) model to perform OCR on those words provide us with output ready to be updated into the database. This automated system cuts down the waiting time and also increases the efficiency as compared to the current system being used.
Publication URL: Read it here
Certifications
Microsoft Certified
Amazon Web Services
Python
Stocks and Finance