D

Project Templates

It is very easy and convenient to put all the data, code, and outputs in the same folder. However, this convenience is negated by disadvantages of having a messy project folder. That is, putting everything into a single folder can easily lead to a folder on your computer with tens or hundreds of files, which can become unmanageable and confusing for not only others, but yourself.

At minimum, I suggest the following folder structure for any analysis project:

my_project/
  |
  |- data/
  |
  |- analysis/
  |
  +- output/

I put all my data sets in the data folder, any code I write for analysis in the analysis folder (sometimes I will name this code or src), and finally cleaned data sets or other outputs such as figures in the output folder. ...

Get Pandas for Everyone: Python Data Analysis, 2nd Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.