How to know which features contribute the most to the outlier score after applying GMM detector?

Ask Question

Asked 1 year, 2 months ago

Modified 1 year, 2 months ago

Viewed 41 times

I have a dataset with 100+ features, upon which I test GMM to detect anomalies. For example, I add some Gaussian noise to 5-6 features of 100 points. GMM detects the points easily, but the next suggested step is to develop an algorithm to locate the features with noise. This is where I got stuck.

Outlier score returned by the sklearn is calculated as a sum for all the dimensions of a datapoint. I tried to retrieve internal variables to understand the process of the Gaussian log-likelihood calculation, which underlies the outlier score and somehow segregate features which have outstanding values, but that was not successful. I suspect this has something to do with the way covariance matrices are calculated.

I would be happy to get some hints on where to look at either inside the GMM algorithm or suggestions on some post-detection analysis methods.

edited Sep 27, 2024 at 16:24

asked Sep 26, 2024 at 9:13

AlisherAliev

112 bronze badges

Add a comment |

0 Your Answer

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Stack Exchange Network

How to know which features contribute the most to the outlier score after applying GMM detector?

0

Your Answer

Hot Network Questions

How to know which features contribute the most to the outlier score after applying GMM detector?

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Your Answer

Sign up or log in

Post as a guest

Related

Hot Network Questions