Chandana Arutla

Data Analyst | ML Engineer

View the Project on GitHub Chandu-2122/portfolio

Portfolio

About Me

Data Analysis enthusiast with a strong academic background in Mathematics and Computer Science, experienced in Data Analysis, Machine Learning, and proficient in Advanced Excel, Python, SQL and Power BI. Desperately waiting to gain hands-on experience to combine tireless hunger for new skills with desire to exploit cutting-edge data science technology.

Education

Parul University, Gujarat [2020 – 2024]

Under Graduation: BTech, Computer Science Engineering with Specialization in Artificial Intelligence

Certifications

Technical Skills

Projects

- Streamlit Breast Cancer Predictor App

Objective: Build an app that predicts cancer by showing the visualization based on the given cell nuclei measurements.

Dataset Description:

Attribute Information:

The mean, standard error, and “worst” or largest (mean of the three largest values) of these features were computed for each image, resulting in 30 features. For instance, field 3 is Mean Radius, field 13 is Radius SE, field 23 is Worst Radius.

Project Structure:

Libraries Used:

Data Preparation:

Visualization:

App Snippet: streamlit_breast_cancer_predictor

Results:

Conclusion:


- Streamlit Exploratory Data Analysis App

Objective: Create a one-line Exploratory Data Analysis (EDA) experience.

Working: This app analyzes the uploaded CSV files, providing in-depth insights into the dataset’s characteristics through exploratory data analysis techniques.

Features:

Libraries Used:

App Snippet:

streamlit_eda

Result: Based on the uploaded CSV file or the example dataset, the EDA report is generated with the help of ydata-profiling and Streamlit on this app.

Conclusion: Streamlit framework made easy to build web application for machine learning by simplifying the creation of interactive and data-driven apps.


- Exploratory Data Analysis (EDA) on Rural Telangana Illiteracy Rates

Objective: Analyse the number of illiterates in Telangana during the year 2014 and compare it with the current year(2023).

Data Source: The dataset is taken from Open Data Telangana

Dataset Description: The dataset provides information about the number of illiterates in the rural areas of Telangana State by gender to gram panchayat level. This data is according to the old districts during the perioid 2014.

Dashboard Snippet:

Dashboard

Insights Gained:

Result: Conducted in-depth analysis of the illiteracy rates in rural Telangana using data sourced from Open Data Telangana and analyzed the number of illiterates in 2014 and compared these statistics with the current year (2023) to gauge changes and trends in literacy rates over the years.

Conclusion: Even after 9 years, the district that was once known as Mahabubnagar has the highest percentage of illiterates in Telangana.


- Web Scraping and Data Extraction from Amazon.in for Electronic Gadgets

Objective: Collect comprehensive data on electronic gadgets commonly used by software employees or students, such as laptops, tablets, smartphones, smartwatches, headphones, earphones, and earbuds, during the Diwali season to especially emphasize the offer deals.

Data Source: The data is scrapped from ‘amazon.in’ website.

Dataset Description: Our required data from the webpage:

Data Preperation: Required data was extracted from the webpage by finding the mentioned tags and if no such tag was found then that value is replaced with an empty string. Products having no title value were removed from the dataset and then saved as a csv file.

Libraries Used:

Data Snippet

Data

Result: Was successfully able to web scrape the amazon.in data once before the header i used got blocked or restricted.

Conclusion: Got to know that the success of web scrapping depends on various factors: