Error Curves
Time Limit: 4000/2000 MS (Java/Others) Memory Limit: 65536/65536 K (Java/Others)Total Submission(s): 860 Accepted Submission(s): 322
Problem Description
Josephina is a clever girl and addicted to Machine Learning recently. She
pays much attention to a method called Linear Discriminant Analysis, which
has many interesting properties.
In order to test the algorithm's efficiency, she collects many datasets.
What's more, each data is divided into two parts: training data and test
data. She gets the parameters of the model on training data and test the
model on test data. To her surprise, she finds each dataset's test error curve is just a parabolic curve. A parabolic curve corresponds to a quadratic function. In mathematics, a quadratic function is a polynomial function of the form f(x) = ax2 + bx + c. The quadratic will degrade to linear function if a = 0.
It's very easy to calculate the minimal error if there is only one test error curve. However, there are several datasets, which means Josephina will obtain many parabolic curves. Josephina wants to get the tuned parameters that make the best performance on all datasets. So she should take all error curves into account, i.e., she has to deal with many quadric functions and make a new error definition to represent the total error. Now, she focuses on the following new function's minimum which related to multiple quadric functions. The new function F(x) is defined as follows: F(x) = max(Si(x)), i = 1...n. The domain of x is [0, 1000]. Si(x) is a quadric function. Josephina wonders the minimum of F(x). Unfortunately, it's too hard for her to solve this problem. As a super programmer, can you help her?
pays much attention to a method called Linear Discriminant Analysis, which
has many interesting properties.
In order to test the algorithm's efficiency, she collects many datasets.
What's more, each data is divided into two parts: training data and test
data. She gets the parameters of the model on training data and test the
model on test data. To her surprise, she finds each dataset's test error curve is just a parabolic curve. A parabolic curve corresponds to a quadratic function. In mathematics, a quadratic function is a polynomial function of the form f(x) = ax2 + bx + c. The quadratic will degrade to linear function if a = 0.
It's very easy to calculate the minimal error if there is only one test error curve. However, there are several datasets, which means Josephina will obtain many parabolic curves. Josephina wants to get the tuned parameters that make the best performance on all datasets. So she should take all error curves into account, i.e., she has to deal with many quadric functions and make a new error definition to represent the total error. Now, she focuses on the following new function's minimum which related to multiple quadric functions. The new function F(x) is defined as follows: F(x) = max(Si(x)), i = 1...n. The domain of x is [0, 1000]. Si(x) is a quadric function. Josephina wonders the minimum of F(x). Unfortunately, it's too hard for her to solve this problem. As a super programmer, can you help her?
Input
The input contains multiple test cases. The first line is the number of cases T (T < 100). Each case begins with a number n (n ≤ 10000). Following n lines, each line contains three integers a (0 ≤ a ≤ 100), b (|b| ≤ 5000), c (|c| ≤ 5000), which mean the corresponding coefficients of a quadratic function.
Output
For each test case, output the answer in a line. Round to 4 digits after the decimal point.
Sample Input
2 1 2 0 0 2 2 0 0 2 -4 2
Sample Output
0.0000 0.5000
Author
LIN, Yue
Source
凹函数,用三分法做,精度卡的比较严。
代码如下:
#include<iostream>
#include<cstdlib>
#include<cstring>
#include<cstdio>
#include<cmath>
#include<ctime>
using namespace std;
struct poi
{
double a, b, c;
} xi[10002];
const double INF = 0.000000001;//注意精度
double cal(double x, int n)
{
double max = -0x7FFFFFFF;
for(int i = 0; i < n; i++)
{
double ans = xi[i].a * x * x + xi[i].b * x + xi[i].c;
if(ans > max)
max = ans;
}
return max;
}
int main()
{
#ifdef test
freopen("in.txt", "r", stdin);
#endif
int t, n;
scanf("%d", &t);
while(t--)
{
double left = 0, right = 1000;
scanf("%d", &n);
for(int i = 0; i < n; i++)
scanf("%lf%lf%lf", &xi[i].a, &xi[i].b, &xi[i].c);
while(left + INF < right)
{
double mid = (left + right) / 2;
double midmid = (mid + right) / 2;
double mid_v = cal(mid, n);
double midmid_v = cal(midmid, n);
if(mid_v < midmid_v)
right = midmid;
else
left = mid;
}
printf("%.4lf\n",cal(right, n));
}
return 0;
}

本文探讨了在多个数据集上使用线性判别分析时遇到的多项式误差曲线问题。通过对一系列抛物线错误曲线进行分析,目标是最小化所有数据集上的总体错误率。文章提出了一种通过三分法来寻找最小误差的方法,并提供了一个实现该算法的具体代码示例。
375

被折叠的 条评论
为什么被折叠?



