https://pandas.pydata.org/docs/getting_started/intro_tutorials/10_text_data.html How to manipulate textual data — pandas 2.2.0 documentation This tutorial uses the Titanic data set, stored as CSV. The data consists of the following data columns: PassengerId: Id of every passenger. Survived: Indication whether passenger survived. 0 for yes and 1 for no. Pclass: One out of the 3 ticket classes: C ..