2020 is almost over, and we’re already fully looking forward to 2021. Even so, we can’t help but take one last look at the year that was. At the Explorium blog, we covered quite a bit of ground, exploring some interesting areas of data science, diving into machine learning (ML) for business, and even throwing in a few of our favorite TV shows. Before we turn the page on 2020, let’s look back at our top ten data science blogs of the year that was.
1. What Happens When The Definition of Risk Changes?
What happens when the rules of the game change overnight and leave you completely exposed? 2020 was a year of surprises, and it taught us the importance of having risk models and the right data to face even an unexpected storm. Read more here.
2. Understanding and Handling Data and Concept Drift
It’s tempting to think our ML models will last forever once we deploy them, but that’s just wishful thinking. In reality, factors like data and concept drift mean that the longer you leave your models, the less effective they’ll be. Read more now to see how you can avoid model drift.
3. How Dynamic Risk Modeling Gives Your Detection a Boost
Rules-based risk models are okay in some situations, but they really miss out when it comes to spotting nuances and slight shifts in established patterns. This post is all about dynamic risk models and why they should be a frontline of defense for organizations that deal with risk. Learn more here.
4. Could ML-Powered Risk Models Really Spot Ozark’s Schemes?
We love to root for the bad guy in TV shows, and Ozark’s Marty Byrde is as sly as criminals come. His money laundering schemes work for the Mexican cartels and avoid the law through three seasons (and counting), but could Marty evade ML-powered risk models? Read more to find out.
5. Could Walter White and Marty Byrde Fool The Best Risk Models?
How smart are Marty Byrde and Walter White, really? This article looks at both Ozark and Breaking Bad’s money-laundering operations and pits them against the best risk models to see if Walter White and Marty Byrde are as smart as they claim they are. See how they stack up here.
6. Turning Data From a Cost Center Into An Asset
Data acquisition is a must in today’s landscape, but data can be costly, and organizations can sometimes drag their feet about writing the check. So, how can you change that? By reframing the problem. You shouldn’t think of the costs that come with data, but the ROI and impact it can provide. Read more now.
7. Clustering: When You Should Use It and Avoid It
If you want good results from your ML models, you need to prepare data the right way. Clustering is an easy and fast way to do some pre-training prep, but it’s not always the right choice. This post is a great primer if you’re curious about clustering and when to use it. Learn more now.
8. How Data and Analytics Leaders Can Avoid Roadblocks to Data Acquisition
What can you do if you have urgent questions but the data you have doesn’t hold answers? It’s time to find new data, but it’s not as easy as it sounds. Data acquisition is a priority for data leaders, but it’s one that can come with landmines and roadblocks. This post breaks down how you can avoid them.
9. Data Bias and What It Means for Your ML Models
Machines are supposed to be completely objective, but unfortunately for ML, that’s not always the case. Indeed, though they might not be emotionally attached to a problem, ML models may develop biases that impact your ability to make predictions. It’s key to understand how and why so you can mitigate its effects. Read more about it now.
10. Cracking the Stability of Feature Selection
Choosing the right features to train your ML models is a tough step, and getting it wrong can severely impact your results. But it’s not just about picking the best-sounding features. You need to make sure the features you choose are optimal for your question and will be stable throughout the ML process. Learn more about it here.
So long, 2020. See you next year!
It’s been a great year at the Explorium blog; with so many new posts and top data science blogs, we could have easily picked our top 20. However, it’s time to turn the page and see what 2021 has in store for us in terms of our upcoming top data science blogs. Tune in next year (in a few days, that is) to see what we have prepared for you. We promise you won’t be bored, and you’ll learn some great new things.