Coursera 机器学习第9周作业1

本文探讨了异常检测算法的应用场景,包括识别信用卡交易中的异常、发现医疗记录中的异常健康状况及制造缺陷等。同时讨论了如何调整算法参数以提高检测准确性,并提出通过增加新的特征来捕捉特定类型的异常。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

1、For which of thefollowing problems would anomaly detection be a suitable algorithm? 选2和3

Givendata from credit card transactions, classify each transaction according to typeof purchase (for example: food, transportation, clothing).

Froma large set of primary care patient records, identify individuals who mighthave unusual health conditions.

Ina computer chip fabrication plant, identify microchips that might be defective.

Froma large set of hospital patient records, predict which patients have aparticular disease (say, the flu).

2、Suppose you havetrained an anomaly detection system for fraud detection, and your system thatflags anomalies when p(x) is less than ε,and you find on the cross-validation set that it is missing many fradulenttransactions (i.e., failing to flag them as anomalies). What should you do?选2

Decrease ε

Increase ε

3、Suppose you aredeveloping an anomaly detection system to catch manufacturing defects inairplane engines. You model uses   p(x)=∏nj=1p(xj;μj,σ2j).

You have two features x1 =vibration intensity, and x2 = heat generated. Both x1 and x2takeon values between 0 and 1 (and are strictly greater than 0), and for most"normal" engines you expect that x1≈x2. One of thesuspected anomalies is that a flawed engine may vibrate very intensely evenwithout generating much heat (large x1, small x2), eventhough the particular values of x1 and x2 maynot fall outside their typical ranges of values. What additional feature x3 shouldyou create to capture these types of anomalies: 选2

x3=x1+x2

x3=x1/x2

x3=1/x1

x3=1/x2

4、Which of thefollowing are true? Check all that apply.选1和4

Whenchoosing features for an anomaly detection system, it is a good idea to lookfor features that take on unusually large or small values for (mainly the)anomalous examples.

Ifyou are developing an anomaly detection system, there is no way to make use oflabeled data to improve your system.

Ifyou have a large labeled training set with many positive examples and manynegative examples, the anomaly detection algorithm will likely perform just aswell as a supervised learning algorithm such as an SVM.

Ifyou do not have any labeled data (or if all your data has label y=0),then is is still possible to learn p(x), but it may beharder to evaluate the system or choose a good value of ϵ.

5、You have a 1-Ddataset {x(1),…,x(m)} and you want to detectoutliers in the dataset. You first plot the dataset and it looks like this:

                                              

Suppose you fit the gaussian distribution parameters μ1 and σ21 tothis dataset. Which of the following values for μ1 and σ21 mightyou get?  选1

μ1=−3,σ21=4

μ1=−6,σ21=4

μ1=−3,σ21=2

μ1=−6,σ21=2

评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值