Coursera 机器学习第9周作业1

最新推荐文章于 2020-05-03 17:11:04 发布

独木不林

最新推荐文章于 2020-05-03 17:11:04 发布

阅读量6.4k

点赞数 3

CC 4.0 BY-SA版权

分类专栏：机器学习

本文链接：https://blog.youkuaiyun.com/liuyanlin610/article/details/51259563

机器学习专栏收录该内容

11 篇文章

订阅专栏

本文探讨了异常检测算法的应用场景，包括识别信用卡交易中的异常、发现医疗记录中的异常健康状况及制造缺陷等。同时讨论了如何调整算法参数以提高检测准确性，并提出通过增加新的特征来捕捉特定类型的异常。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

1、For which of thefollowing problems would anomaly detection be a suitable algorithm? 选2和3

Givendata from credit card transactions, classify each transaction according to typeof purchase (for example: food, transportation, clothing).

Froma large set of primary care patient records, identify individuals who mighthave unusual health conditions.

Ina computer chip fabrication plant, identify microchips that might be defective.

Froma large set of hospital patient records, predict which patients have aparticular disease (say, the flu).

2、Suppose you havetrained an anomaly detection system for fraud detection, and your system thatflags anomalies when p(x) is less than ε,and you find on the cross-validation set that it is missing many fradulenttransactions (i.e., failing to flag them as anomalies). What should you do?选2

Decrease ε

Increase ε

3、Suppose you aredeveloping an anomaly detection system to catch manufacturing defects inairplane engines. You model uses p(x)=∏nj=1p(xj;μj,σ2j).

You have two features x1 =vibration intensity, and x2 = heat generated. Both x1 and x2takeon values between 0 and 1 (and are strictly greater than 0), and for most"normal" engines you expect that x1≈x2. One of thesuspected anomalies is that a flawed engine may vibrate very intensely evenwithout generating much heat (large x1, small x2), eventhough the particular values of x1 and x2 maynot fall outside their typical ranges of values. What additional feature x3 shouldyou create to capture these types of anomalies: 选2

x3=x1+x2

x3=x1/x2

x3=1/x1

x3=1/x2

4、Which of thefollowing are true? Check all that apply.选1和4

Whenchoosing features for an anomaly detection system, it is a good idea to lookfor features that take on unusually large or small values for (mainly the)anomalous examples.

Ifyou are developing an anomaly detection system, there is no way to make use oflabeled data to improve your system.

Ifyou have a large labeled training set with many positive examples and manynegative examples, the anomaly detection algorithm will likely perform just aswell as a supervised learning algorithm such as an SVM.

Ifyou do not have any labeled data (or if all your data has label y=0),then is is still possible to learn p(x), but it may beharder to evaluate the system or choose a good value of ϵ.

5、You have a 1-Ddataset {x(1),…,x(m)} and you want to detectoutliers in the dataset. You first plot the dataset and it looks like this: