Filter Bar
The Filter Bar is a key component of Spotlight's UI that allows you to filter your data based on specific criteria. With the Filter Bar, you can quickly narrow down your dataset to a subset that meets your needs, making it easier to work with and analyze.
Using the Filter Bar is simple: just select the column and the predicate to filter by, and Spotlight will update the data views accordingly.
Additional you can convert any active Selection into a Filter and apply it on your dataset. You can also combine all currently applied filters and rename the new Filter.
Most of the components Spotlight provides will react to data being filtered out of the dataset, and will update accordingly.
The Similarity Map and others even have options to completely ignore all datapoints not in the filtered dataset.
Adding a Predicate Filter
Disabling and Delating a Filter
Combining Filters
To create a new filter based on the currently applied filters, click the "Combine Filters" button. This will merge all active filters into a new filter, which you can rename to suit your needs.
Filter Selected
If you want to create a new filter based on the currently selected data points, you can use the "Filter from Selection" feature. This allows you to quickly turn your selection into a new filter
Filter string column
At first glance, filtering string columns in Spotlight may appear similar to filtering other data types. However, string filtering is actually much more powerful because it allows you to use regular expressions to filter matching strings.
This allows for filtering more complex patterns than just equality or inequality of two strings.
In order to find all labels columns in the fsd50k dataset containing the label Siren
or Alarm
you can e.g. create the following filter:
labels = Siren|Alarm
Overall, the use of regular expressions in Spotlight's string filtering feature gives you greater flexibility and control over your data analysis. Whether you're filtering for simple patterns or using complex regular expressions to refine your dataset.