Read CSV
Default loader; handles most messy files.
import pandas as pd
df = pd.read_csv("orders.csv")Hand-picked pandas patterns from the Data Science track. The 80% you actually reach for in real analytics work, not every API.
Default loader; handles most messy files.
import pandas as pd
df = pd.read_csv("orders.csv")Same, for spreadsheets.
df = pd.read_excel("orders.xlsx", sheet_name="Q1")First thing after loading.
df.head() df.info() df.describe()
By name; list for many.
df["price"] df[["name", "price"]]
Boolean mask. Combine with & / |.
df[df["price"] > 100] df[(df["price"] > 100) & (df["status"] == "paid")]
Vectorised — never loop.
df["price_with_tax"] = df["price"] * 1.2
Row-wise transform. Slower; use when vectorised won't do.
df["initials"] = df["name"].apply(lambda n: n[0])
Cleanup before pipeline.
df = df.rename(columns={"old": "new"})
df = df.drop(columns=["unused"])Remove rows with missing values.
df.dropna(subset=["price"])
Sensible defaults.
df["score"].fillna(0, inplace=False)
Force a column to the right type.
df["price"] = df["price"].astype(float)
Convert string columns to datetime.
df["ts"] = pd.to_datetime(df["ts"])
Single metric per group.
df.groupby("country")["revenue"].sum()Multiple metrics in one shot.
df.groupby("country").agg(
revenue=("price", "sum"),
orders=("price", "count"),
)Spreadsheet-style 2D aggregation.
df.pivot_table(index="country", columns="quarter", values="revenue", aggfunc="sum")
Inner / left / outer joins.
pd.merge(orders, customers, on="customer_id", how="left")
Stack two frames vertically or side-by-side.
pd.concat([df1, df2], ignore_index=True)
Unlocks resample + rolling.
df = df.set_index("ts").sort_index()Aggregate by time bucket.
df["revenue"].resample("M").sum()Smooth + lag features.
df["revenue"].rolling(7).mean()
Default export.
df.to_csv("out.csv", index=False)Compact, typed, fast to re-read.
df.to_parquet("out.parquet")Get a feel for shape; matplotlib under the hood.
df["revenue"].plot(kind="line")
Want to actually learn these patterns, not just paste them? Open the pandas cheatsheet track — each snippet has a full lesson behind it.