Extracting Pre-Defined Themes By Processing Data Using Classification Models

0
457

[ad_1]

Contributed By: SAURABH SETHI

BACKGROUND: I’m Saurabh Sethi. I’ve 11+ years of expertise in quite a lot of fields. Being an early boomer, I began to discover alternatives from name facilities whereby I discovered to maintain up environment friendly communication convert gross sales, and construct relationships. With the expertise of crew constructing and good communication, I used to be employed by Genpact to assist fee assortment for a healthcare supplier whereby I used to be uncovered to quite a lot of completely different metrics and constructed my curiosity in knowledge. Eventually, I had a few inside actions and had an opportunity to experiment with supplier and billing knowledge and took part in a Master Data Management Project. With excessive ambition, I continued my profession by becoming a member of ATCS Inc. to pilot social media analytics and listening for international manufacturers, which drew me to the center of analytics, and now I’m awaiting my Data Science PG diploma.

PROBLEM STATEMENT: In the method of providing Social Media Listening and digital methods, we use publicly accessible social media submit feeds based mostly on mentions to unearth the hidden secrets and techniques that may drive methods for our partnering manufacturers. And this leads us to the problem of coping with extremely dispersed and qualitative knowledge, which necessitates a major quantity of handbook effort slicing and dicing via 1000’s of contextual knowledge factors to uncover themes and patterns to construct on inferences.

GOAL STATEMENT: Create a supervised classification mannequin skilled on a sure subject to extract pre-defined themes by processing hundreds of thousands of knowledge rows accounting for social media customers and sarcasm.

TECHNIQUES USED: Using historic knowledge on specialised themes, we constructed a Supervised Classification mannequin with regression-based Support Vector Machine method on cleaned and tokenized contextual knowledge by way of Natural Language Processing, and deployed it on a React Native software.

OBSERVATIONS: Using the strategies discovered within the coaching, we found various abnormalities and redundancies within the knowledge as a result of some dominating discussions from influential social media accounts, which opened up one other use case round writer segmentation and mapping.

SOLUTION: We efficiently deployed the classification mannequin on a frontend software, permitting customers to categorize the social media feeds into pre-defined labels, eradicating the time-consuming technique of manually studying and segmenting the dialog. This enabled the digital analyst to quantify the assorted speaking factors and go additional into the info to search out the principle issues and alternative areas for the model. The mannequin is at present configured utilizing SVM regression equations, which offer an accuracy of 92% and course of 1 million rows of contextual knowledge factors in about 5 minutes.

“Automation is cost-cutting by tightening the corners and not cutting them.” – Haresh Sippy

LEAVE A REPLY

Please enter your comment!
Please enter your name here