Skip to main content

🎉 We released a Spotlight OSS Version! ⭐ Star it on Github

8 posts tagged with "data curation"

View All Tags

· 7 min read
We have just released the open version of our data curation software Renumics Spotlight. It is intended for cross-functional teams who want to be in control of their data and data curation processes. In this post I would like to share our ideas behind this product.

· 8 min read
Data collection for condition monitoring has several pitfalls, potentially leading to data that is not suitable for training robust machine learning models. The data problems resulting from the data collection include but are not limited to the presence of failures in the recording equipment, the dominance of specific operating conditions, or mislabeled audio samples. In this article, we will thus help you to ask the right questions and equip you with a checklist you can use when collecting and preparing data for your condition monitoring use case.

· 11 min read
Data collection for condition monitoring has several pitfalls, potentially leading to data that is not suitable for training robust machine learning models. The data problems resulting from the data collection include but are not limited to the presence of failures in the recording equipment, the dominance of specific operating conditions, or mislabeled audio samples. In this article, we will thus help you to ask the right questions and equip you with a checklist you can use when collecting and preparing data for your condition monitoring use case.

· 6 min read
Selecting samples for training robust surrogate models in simulation can be a real challenge. Active learning-like approaches where samples are selected iteratively can help overcome this challenge. We show how to apply such a procedure to save time and computational resources while making your surrogate model even more robust.

· 10 min read
Building robust models for Visual Inspection in production settings can be a real challenge. Here, cloud services like Amazon Lookout for Vision promise relief for model training but have limitations regarding data curation. This article explores those potential shortcomings and shows how to improve over them to leverage these services to the fullest.

· 3 min read
Making your data match the real-world data of your use-case is crucial for training a robust machine learning model. This post shows you how to interactively curate your data to adapt your data in an informed manner.