Wednesday, April 18, 20183:00 pm – 4:00 pm
129 Hesburgh Library, Center for Digital Scholarship
Workshop Overview
Python is a free, widely-used, open source programming language for general purpose programming. It lets you work quickly and integrate systems more effectively.
Pandas is a free software library, written for Python programming language, for data manipulation and analysis. It offers data structures and operations for manipulating numerical tables and time series. Pandas is well suited for many different kinds of data, such as:
- Tabular data with heterogeneously-typed columns, as in an SQL table or Excel spreadsheet
- Ordered and unordered (not necessarily fixed-frequency) time series data
- Arbitrary matrix data (homogeneously typed or heterogeneous) with row and column labels
- Any other form of observational / statistical data sets. The data doesn’t need to be labeled to be placed into a pandas data structure
This introductory workshop will demonstrate some of the capabilities of Python for basic data manipulation and analysis, with an emphasis on pandas.
No prior knowledge is assumed.
Instructor
James Ng, Economics and Social Science Data Librarian
Jng2@nd.edu
