K Poornima - Profile

WEB SCRAPING AND PREDICTIVE MODELING FOR BRITISH AIRWAYS: HARNESSING DATA TO IMPROVE BUSINESS DECISIONS

Industry: Aviation
GitHub URL: https://github.com/PoornimaKanasen/British_Airways_Virtual_Internship

To learn how to scrape customer review data and build predictive models.

The following steps were carried out throughout the data preprocessing and cleaning phase:

Python, BeautifulSoup, Pandas, Numpy, Matplotlib, Seaborn, Sklearn

• From our sentiment analysis with TextBlob, we have segregated the intensity of the comment into three categories which are negative, neutral and positive.
• The wordclouds are an easier way to digest large amount of data. Most frequently used words are displayed in a larger font size and the less frequent words are displayed in smaller font sizes. Both the positive and negative wordclouds have some similarities. Positive Wordcloud shows words like flight, seat and legroom while the negative shows words like flight, seat, poor, bad and so on.