Technique #2: Sampling
How do you load only a subset of the rows?
When you load your data, you can specify a skiprows function that will randomly decide whether to load that row or not:
```
from random import random
def sample(row_number): ... if row_number == 0: ... # Never drop the row with column names: ... return False ... # random() returns uniform numbers between 0 and 1: ... return random() > 0.001 ... sampled = pd.read_csv("/tmp/voting.csv", skiprows=sample) len(sampled) 973 ```