Model Development
Data Science, Data Management Paul Karsten Data Science, Data Management Paul Karsten

Model Development

This blog post outlines the second phase of our Data Science Process: Model Development. Which involves building, training, and evaluating models based on data gathered during Question Formation. The process is iterative, experimenting with different algorithms, features, and parameters in a sandbox environment before scaling to larger datasets. Model performance is evaluated using metrics, validation for overfitting/underfitting, and checks for robustness and interpretability. Finally, models must be versioned, monitored for data drift, and continuously updated to ensure they remain effective and relevant over time.

Read More
Question Formation and Data Analysis in Data Science
Data Science, Data Management Paul Karsten Data Science, Data Management Paul Karsten

Question Formation and Data Analysis in Data Science

This blog post focuses on the first phase of our Data Science Process: Question Formation and Data Analysis. In this phase, we iterate multiple times through question formation, data collection, and exploration. Initial questions are likely to be of low fidelity. Through the process of data exploration, the questions gain fidelity and drive toward business value.

Read More