fun where(booleans: BooleanArray, ifTrue: Any, ifFalse: Any): DataCol
A numpy equivalent to
df['color'] = np.where(df['Set']=='Z', 'green', 'red')
See https://stackoverflow.com/questions/19913659/pandas-conditional-creation-of-a-series-dataframe-column
In R the corresoponding pattern would be mutate(df, foo=if_else())
flightsData.addColumn("delay_category") { df ->
where(df["dep_delay"].toDoubles().mapNonNull { Math.abs(it) > 3 }.nullAsFalse(), "red", "green")
}