Titanic Train Data Analysis using R Language
Introduction:
In this article, we will perform a detailed analysis of the Titanic train dataset using the R programming language. The dataset contains information about the passengers aboard the Titanic, including their survival status, passenger class, age, gender, and more. We will explore the dataset, perform data cleaning and preprocessing, conduct exploratory data analysis, and build predictive models to determine the factors influencing survival on the Titanic.
Dataset Overview:
The Titanic train dataset consists of the following variables:
- PassengerId: Unique identifier for each passenger
- Survived: Survival status (0 = No, 1 = Yes)
- Pclass: Passenger class (1 = 1st, 2 = 2nd, 3 =