MySQL 基础入门
Day5 MySQL 实战
一、学习内容
1. 数据导入导出
1.1 将之前创建的任意一张MySQL表导出,且是CSV格式
参考教材:https://www.yiibai.com/mysql/export-table-to-csv.html
CSV代表逗号分隔值。 您经常使用CSV文件格式在Microsoft Excel,Open Office,Google Docs等应用程序之间交换数据。
以CSV文件格式从MySQL数据库中获取数据将非常有用,因为您可以按照所需的方式分析和格式化数据。
MySQL提供了一种将查询结果导出到位于数据库服务器中的CSV文件的简单方法。
在导出数据之前,必须确保:
- MySQL服务器的进程对包含目标CSV文件的目标文件夹具有写访问权限。
- 要导出的目标CSV文件不能存在。
# 示例:
SELECT * from customers
WHERE city='NYC'
INTO OUTFILE 'E:/Learning/MySQL/customer.csv'
FIELDS TERMINATED BY ',' ENCLOSED BY '"'
LINES TERMINATED BY '\r\n';
- 导出失败常见问题:
- [Err] 1290 - The MySQL server is running with the --secure-file-priv option so it cannot execute this statement
查看官方文档,secure_file_priv参数用于限制LOAD DATA, SELECT …OUTFILE, LOAD_FILE()传到哪个指定目录。查看 secure_file_priv 的值(命令:SHOW VARIABLES LIKE “secure_file_priv”;),默认为NULL,表示限制不能导入导出。
解决方法:打开my.cnf 或 my.ini,加入 secure_file_priv=’’ 语句后重启mysql。
问题分析请查阅:https://blog.youkuaiyun.com/fdipzone/article/details/78634992 - 其他常见问题:mysql导出导入文件问题整理
- [Err] 1290 - The MySQL server is running with the --secure-file-priv option so it cannot execute this statement
1.2 再将CSV表导入数据库
参考教材:https://www.yiibai.com/mysql/import-csv-file-mysql-table.html
LOAD DATA INFILE语句允许您从文本文件读取数据,并将文件的数据快速导入数据库的表中。
在导入文件操作之前,需要准备以下内容:
- 将要导入文件的数据对应的数据库表。
- 准备好一个CSV文件,其数据与表的列数和每列中的数据类型相匹配。
- 连接到MySQL数据库服务器的帐户具有FILE和INSERT权限。
# 示例:
LOAD DATA INFILE 'E:/Learning/MySQL/customer.csv'
INTO TABLE customers1
FIELDS TERMINATED BY ','
ENCLOSED BY '"'
LINES TERMINATED BY '\n'
IGNORE 1 ROWS;
- 常见错误:
[Err] 1366 - Incorrect decimal value: ‘138500.00" "181’ for column ‘creditLimit’ at row 1
二、作业
项目七: 各部门工资最高的员工(难度:中等)
创建Employee 表,包含所有员工信息,每个员工有其对应的 Id, salary 和 department Id。
+----+-------+--------+--------------+
| Id | Name | Salary | DepartmentId |
+----+-------+--------+--------------+
| 1 | Joe | 70000 | 1 |
| 2 | Henry | 80000 | 2 |
| 3 | Sam | 60000 | 2 |
| 4 | Max | 90000 | 1 |
+----+-------+--------+--------------+
创建Department 表,包含公司所有部门的信息。
+----+----------+
| Id | Name |
+----+----------+
| 1 | IT |
| 2 | Sales |
+----+----------+
编写一个 SQL 查询,找出每个部门工资最高的员工。例如,根据上述给定的表格,Max 在 IT 部门有最高工资,Henry 在 Sales 部门有最高工资。
+------------+----------+--------+
| Department | Employee | Salary |
+------------+----------+--------+
| IT | Max | 90000 |
| Sales | Henry | 80000 |
+------------+----------+--------+
# 创建表 Empolyee
mysql> CREATE TABLE Employee (
-> Id INT NOT NULL PRIMARY KEY,
-> Name VARCHAR(50) NULL,
-> Salary INT NULL,
-> DepartmentId INT NOT NULL
-> );
Query OK, 0 rows affected (0.24 sec)
# 插入数据
mysql> INSERT INTO Employee (Id, Name, Salary, DepartmentId) VALUES (1, "Joe", 70000, 1);
Query OK, 1 row affected (0.05 sec)
mysql> INSERT INTO Employee (Id, Name, Salary, DepartmentId) VALUES (2, "Henry", 80000, 2);
Query OK, 1 row affected (0.19 sec)
mysql> INSERT INTO Employee (Id, Name, Salary, DepartmentId) VALUES (3, "Sam", 60000, 2);
Query OK, 1 row affected (0.04 sec)
mysql> INSERT INTO Employee (Id, Name, Salary, DepartmentId) VALUES (4, "Max", 90000, 1);
Query OK, 1 row affected (0.04 sec)
# 创建表 Department
mysql> CREATE TABLE Department (
-> Id INT NOT NULL PRIMARY KEY,
-> Name VARCHAR(50) NULL
-> );
Query OK, 0 rows affected (0.29 sec)
# 插入数据
mysql> INSERT INTO Department (Id, Name) VALUES (1, "IT");
Query OK, 1 row affected (0.07 sec)
mysql> INSERT INTO Department (Id, Name) VALUES (2, "Sales");
Query OK, 1 row affected (0.05 sec)
# 查询结果
mysql> SELECT
-> d.Name AS Department,
-> e.Name AS Employee,
-> e.Salary AS Salary
-> FROM
-> Employee AS e,
-> Department AS d
-> WHERE e.DepartmentId = d.Id AND
-> e.Salary >= (SELECT MAX(Salary) FROM Employee WHERE DepartmentId=d.Id)
-> ORDER BY Salary DESC;
+------------+----------+--------+
| Department | Employee | Salary |
+------------+----------+--------+
| IT | Max | 90000 |
| Sales | Henry | 80000 |
+------------+----------+--------+
2 rows in set (0.14 sec)
项目八: 换座位(难度:中等)
小美是一所中学的信息科技老师,她有一张 seat 座位表,平时用来储存学生名字和与他们相对应的座位 id。
其中纵列的 id 是连续递增的
小美想改变相邻俩学生的座位。
你能不能帮她写一个 SQL query 来输出小美想要的结果呢?
请创建如下所示seat表:
示例:
+---------+---------+
| id | student |
+---------+---------+
| 1 | Abbot |
| 2 | Doris |
| 3 | Emerson |
| 4 | Green |
| 5 | Jeames |
+---------+---------+
假如数据输入的是上表,则输出结果如下:
+---------+---------+
| id | student |
+---------+---------+
| 1 | Doris |
| 2 | Abbot |
| 3 | Green |
| 4 | Emerson |
| 5 | Jeames |
+---------+---------+
注意:
如果学生人数是奇数,则不需要改变最后一个同学的座位。
# 创建表 seat
mysql> CREATE TABLE seat (
-> id INT NOT NULL PRIMARY KEY,
-> student VARCHAR(50) NULL
-> );
Query OK, 0 rows affected (0.28 sec)
# 插入数据
mysql> INSERT INTO seat (id, student) VALUES (1, "Abbot");
Query OK, 1 row affected (0.06 sec)
mysql> INSERT INTO seat (id, student) VALUES (2, "Doris");
Query OK, 1 row affected (0.05 sec)
mysql> INSERT INTO seat (id, student) VALUES (3, "Emerson");
Query OK, 1 row affected (0.04 sec)
mysql> INSERT INTO seat (id, student) VALUES (4, "Green");
Query OK, 1 row affected (0.09 sec)
mysql> INSERT INTO seat (id, student) VALUES (5, "Jeames");
Query OK, 1 row affected (0.06 sec)
# 查询结果
mysql> SELECT * FROM (
-> SELECT id,student FROM seat WHERE id%2=1 AND id= (SELECT count(*) FROM seat)
-> UNION
-> SELECT id+1,student FROM seat WHERE id%2=1 AND id!=(SELECT count(*) FROM seat)
-> UNION
-> SELECT id-1,student FROM seat WHERE id%2=0) s1 ORDER BY id;
+----+---------+
| id | student |
+----+---------+
| 1 | Doris |
| 2 | Abbot |
| 3 | Green |
| 4 | Emerson |
| 5 | Jeames |
+----+---------+
5 rows in set (0.01 sec)
项目九: 分数排名(难度:中等)
编写一个 SQL 查询来实现分数排名。如果两个分数相同,则两个分数排名(Rank)相同。请注意,平分后的下一个名次应该是下一个连续的整数值。换句话说,名次之间不应该有“间隔”。
创建以下score表:
+----+-------+
| Id | Score |
+----+-------+
| 1 | 3.50 |
| 2 | 3.65 |
| 3 | 4.00 |
| 4 | 3.85 |
| 5 | 4.00 |
| 6 | 3.65 |
+----+-------+
例如,根据上述给定的 Scores 表,你的查询应该返回(按分数从高到低排列):
+-------+------+
| Score | Rank |
+-------+------+
| 4.00 | 1 |
| 4.00 | 1 |
| 3.85 | 2 |
| 3.65 | 3 |
| 3.65 | 3 |
| 3.50 | 4 |
+-------+------+
# 创建表
mysql> CREATE TABLE score (
-> Id INT NOT NULL PRIMARY KEY,
-> Score float(6,3) NULL
-> );
Query OK, 0 rows affected (0.31 sec)
# 插入数据
mysql> INSERT INTO score (Id, Score) VALUES (1, 3.50);
Query OK, 1 row affected (0.06 sec)
mysql> INSERT INTO score (Id, Score) VALUES (2, 3.65);
Query OK, 1 row affected (0.07 sec)
mysql> INSERT INTO score (Id, Score) VALUES (3, 4.00);
Query OK, 1 row affected (0.06 sec)
mysql> INSERT INTO score (Id, Score) VALUES (4, 3.85);
Query OK, 1 row affected (0.06 sec)
mysql> INSERT INTO score (Id, Score) VALUES (5, 4.00);
Query OK, 1 row affected (0.04 sec)
mysql> INSERT INTO score (Id, Score) VALUES (6, 3.65);
Query OK, 1 row affected (0.06 sec)
# 查询结果
mysql> SELECT Score,(SELECT COUNT(DISTINCT score) from score WHERE score >= s.score) as Rank
-> from score as s
-> ORDER BY Score DESC;
+-------+------+
| Score | Rank |
+-------+------+
| 4.00 | 1 |
| 4.00 | 1 |
| 3.85 | 2 |
| 3.65 | 3 |
| 3.65 | 3 |
| 3.50 | 4 |
+-------+------+
6 rows in set (0.01 sec)
三、MySQL 实战 - 复杂项目
项目十:行程和用户(难度:困难)
【参考网上教程】
Trips 表中存所有出租车的行程信息。每段行程有唯一键 Id,Client_Id 和 Driver_Id 是 Users 表中 Users_Id 的外键。Status 是枚举类型,枚举成员为 (‘completed’, ‘cancelled_by_driver’, ‘cancelled_by_client’)。
+----+-----------+-----------+---------+--------------------+----------+
| Id | Client_Id | Driver_Id | City_Id | Status |Request_at|
+----+-----------+-----------+---------+--------------------+----------+
| 1 | 1 | 10 | 1 | completed |2013-10-01|
| 2 | 2 | 11 | 1 | cancelled_by_driver|2013-10-01|
| 3 | 3 | 12 | 6 | completed |2013-10-01|
| 4 | 4 | 13 | 6 | cancelled_by_client|2013-10-01|
| 5 | 1 | 10 | 1 | completed |2013-10-02|
| 6 | 2 | 11 | 6 | completed |2013-10-02|
| 7 | 3 | 12 | 6 | completed |2013-10-02|
| 8 | 2 | 12 | 12 | completed |2013-10-03|
| 9 | 3 | 10 | 12 | completed |2013-10-03|
| 10 | 4 | 13 | 12 | cancelled_by_driver|2013-10-03|
+----+-----------+-----------+---------+--------------------+----------+
Users 表存所有用户。每个用户有唯一键 Users_Id。Banned 表示这个用户是否被禁止,Role 则是一个表示(‘client’, ‘driver’, ‘partner’)的枚举类型。
+----------+--------+--------+
| Users_Id | Banned | Role |
+----------+--------+--------+
| 1 | No | client |
| 2 | Yes | client |
| 3 | No | client |
| 4 | No | client |
| 10 | No | driver |
| 11 | No | driver |
| 12 | No | driver |
| 13 | No | driver |
+----------+--------+--------+
写一段 SQL 语句查出 2013年10月1日 至 2013年10月3日 期间非禁止用户的取消率。基于上表,你的 SQL 语句应返回如下结果,取消率(Cancellation Rate)保留两位小数。
+------------+-------------------+
| Day | Cancellation Rate |
+------------+-------------------+
| 2013-10-01 | 0.33 |
| 2013-10-02 | 0.00 |
| 2013-10-03 | 0.50 |
+------------+-------------------+
# 创建表 Trips
mysql> CREATE TABLE Trips (
-> Id INT NOT NULL PRIMARY KEY,
-> Client_Id INT NOT NULL,
-> Driver_Id INT NOT NULL,
-> City_Id INT NOT NULL,
-> Status ENUM('completed', 'cancelled_by_driver', 'cancelled_by_client') NULL,
-> Request_at VARCHAR(50) NULL
-> );
Query OK, 0 rows affected (0.33 sec)
# 插入数据
mysql> INSERT INTO Trips (Id, Client_Id, Driver_Id, City_Id, Status, Request_at) VALUES ('1', '1', '10', '1', 'completed', '2013-10-01');
Query OK, 1 row affected (0.04 sec)
mysql> INSERT INTO Trips (Id, Client_Id, Driver_Id, City_Id, Status, Request_at) VALUES ('2', '2', '11', '1', 'cancelled_by_driver', '2013-10-01');
Query OK, 1 row affected (0.08 sec)
mysql> INSERT INTO Trips (Id, Client_Id, Driver_Id, City_Id, Status, Request_at) VALUES ('3', '3', '12', '6', 'completed', '2013-10-01');
Query OK, 1 row affected (0.04 sec)
mysql> INSERT INTO Trips (Id, Client_Id, Driver_Id, City_Id, Status, Request_at) VALUES ('4', '4', '13', '6', 'cancelled_by_client', '2013-10-01');
Query OK, 1 row affected (0.06 sec)
mysql> INSERT INTO Trips (Id, Client_Id, Driver_Id, City_Id, Status, Request_at) VALUES ('5', '1', '10', '1', 'completed', '2013-10-02');
Query OK, 1 row affected (0.05 sec)
mysql> INSERT INTO Trips (Id, Client_Id, Driver_Id, City_Id, Status, Request_at) VALUES ('6', '2', '11', '6', 'completed', '2013-10-02');
Query OK, 1 row affected (0.13 sec)
mysql> INSERT INTO Trips (Id, Client_Id, Driver_Id, City_Id, Status, Request_at) VALUES ('7', '3', '12', '6', 'completed', '2013-10-02');
Query OK, 1 row affected (0.04 sec)
mysql> INSERT INTO Trips (Id, Client_Id, Driver_Id, City_Id, Status, Request_at) VALUES ('8', '2', '12', '12', 'completed', '2013-10-03');
Query OK, 1 row affected (0.06 sec)
mysql> INSERT INTO Trips (Id, Client_Id, Driver_Id, City_Id, Status, Request_at) VALUES ('9', '3', '10', '12', 'completed', '2013-10-03');
Query OK, 1 row affected (0.05 sec)
mysql> INSERT INTO Trips (Id, Client_Id, Driver_Id, City_Id, Status, Request_at) VALUES ('10', '4', '13', '12', 'cancelled_by_driver', '2013-10-03');
Query OK, 1 row affected (0.05 sec)
# 创建表 Users
mysql> CREATE TABLE Users (
-> Users_Id INT NOT NULL PRIMARY KEY,
-> Banned VARCHAR(50) NULL,
-> Role ENUM('client', 'driver', 'partner') NULL
-> );
Query OK, 0 rows affected (0.32 sec)
# 插入数据
mysql> INSERT INTO Users (Users_Id, Banned, Role) VALUES ('1', 'No', 'client');
Query OK, 1 row affected (0.05 sec)
mysql> INSERT INTO Users (Users_Id, Banned, Role) VALUES ('2', 'Yes', 'client');
Query OK, 1 row affected (0.16 sec)
mysql> INSERT INTO Users (Users_Id, Banned, Role) VALUES ('3', 'No', 'client');
Query OK, 1 row affected (0.12 sec)
mysql> INSERT INTO Users (Users_Id, Banned, Role) VALUES ('4', 'No', 'client');
Query OK, 1 row affected (0.14 sec)
mysql> INSERT INTO Users (Users_Id, Banned, Role) VALUES ('10', 'No', 'driver');
Query OK, 1 row affected (0.06 sec)
mysql> INSERT INTO Users (Users_Id, Banned, Role) VALUES ('11', 'No', 'driver');
Query OK, 1 row affected (0.18 sec)
mysql> INSERT INTO Users (Users_Id, Banned, Role) VALUES ('12', 'No', 'driver');
Query OK, 1 row affected (0.05 sec)
mysql> INSERT INTO Users (Users_Id, Banned, Role) VALUES ('13', 'No', 'driver');
Query OK, 1 row affected (0.06 sec)
# 查询结果
mysql> SELECT t.Request_at AS 'Day',
-> ROUND((SUM(CASE WHEN t.Status LIKE 'cancelled%' THEN 1 ELSE 0 END))/COUNT(*),2) AS 'Cancellation Rate'
-> FROM Trips AS t INNER JOIN Users AS u
-> ON u.Users_Id = t.Client_Id AND u.Banned = 'No'
-> GROUP BY t.Request_at ;
+------------+-------------------+
| Day | Cancellation Rate |
+------------+-------------------+
| 2013-10-01 | 0.33 |
| 2013-10-02 | 0.00 |
| 2013-10-03 | 0.50 |
+------------+-------------------+
3 rows in set (0.00 sec)
项目十一:各部门前3高工资的员工(难度:中等)
将项目7中的employee表清空,重新插入以下数据(其实是多插入5,6两行):
+----+-------+--------+--------------+
| Id | Name | Salary | DepartmentId |
+----+-------+--------+--------------+
| 1 | Joe | 70000 | 1 |
| 2 | Henry | 80000 | 2 |
| 3 | Sam | 60000 | 2 |
| 4 | Max | 90000 | 1 |
| 5 | Janet | 69000 | 1 |
| 6 | Randy | 85000 | 1 |
+----+-------+--------+--------------+
编写一个 SQL 查询,找出每个部门工资前三高的员工。例如,根据上述给定的表格,查询结果应返回:
+------------+----------+--------+
| Department | Employee | Salary |
+------------+----------+--------+
| IT | Max | 90000 |
| IT | Randy | 85000 |
| IT | Joe | 70000 |
| Sales | Henry | 80000 |
| Sales | Sam | 60000 |
+------------+----------+--------+
此外,请考虑实现各部门前N高工资的员工功能。
# 创建表 Employee
mysql> CREATE TABLE IF Not Exists Employee (
-> Id INT NOT NULL,
-> Name VARCHAR(50) NULL,
-> Salary INT NULL,
-> DepartmentId INT NOT NULL
-> );
Query OK, 0 rows affected, 1 warning (0.03 sec)
# 插入数据
mysql> TRUNCATE TABLE Employee;
Query OK, 0 rows affected (0.42 sec)
mysql> INSERT INTO Employee (Id, Name, Salary, DepartmentId) VALUES ('1', 'Joe', '70000', '1');
Query OK, 1 row affected (0.05 sec)
mysql> INSERT INTO Employee (Id, Name, Salary, DepartmentId) VALUES ('2', 'Henry', '80000', '2');
Query OK, 1 row affected (0.05 sec)
mysql> INSERT INTO Employee (Id, Name, Salary, DepartmentId) VALUES ('3', 'Sam', '60000', '2');
Query OK, 1 row affected (0.05 sec)
mysql> INSERT INTO Employee (Id, Name, Salary, DepartmentId) VALUES ('4', 'Max', '90000', '1');
Query OK, 1 row affected (0.04 sec)
mysql> INSERT INTO Employee (Id, Name, Salary, DepartmentId) VALUESs ('5', 'Janet', '69000', '1');
Query OK, 1 row affected (0.09 sec)
mysql> INSERT INTO Employee (Id, Name, Salary, DepartmentId) VALUES ('6', 'Randy', '85000', '1');
Query OK, 1 row affected (0.06 sec)
# 创建表 Department
mysql> CREATE TABLE IF Not Exists Department (
-> Id INT NOT NULL
-> Name VARCHAR(50)
-> );
Query OK, 0 rows affected, 1 warning (0.07 sec)
# 插入数据
mysql> TRUNCATE TABLE Department;
Query OK, 0 rows affected (0.47 sec)
mysql> INSERT INTO Department (Id, Name) VALUES ('1', 'IT');
Query OK, 1 row affected (0.06 sec)
mysql> INSERT INTO Department (Id, Name) VALUES ('2', 'Sales');
Query OK, 1 row affected (0.04 sec)
# 查询结果
mysql> SELECT department.name AS department, e1.name AS employee, e1.salary AS
salary
-> FROM employee e1
-> JOIN department ON e1.departmentid = department.id
-> WHERE ( SELECT COUNT(DISTINCT e2.salary)
-> FROM employee e2
-> WHERE e2.salary > e1.salary AND e1.departmentid = e2.departmentid ) < 3
-> ORDER BY department.name, e1.salary DESC;
+------------+----------+--------+
| department | employee | salary |
+------------+----------+--------+
| IT | Max | 90000 |
| IT | Randy | 85000 |
| IT | Joe | 70000 |
| Sales | Henry | 80000 |
| Sales | Sam | 60000 |
+------------+----------+--------+
5 rows in set (0.00 sec)
项目十二 分数排名 - (难度:中等)
依然是昨天的分数表,实现排名功能,但是排名是非连续的,如下:
+-------+------+
| Score | Rank |
+-------+------+
| 4.00 | 1 |
| 4.00 | 1 |
| 3.85 | 3 |
| 3.65 | 4 |
| 3.65 | 4 |
| 3.50 | 6 |
+-------+------
# 查询结果
mysql> SELECT Score, (SELECT COUNT(*) FROM score AS s2 WHERE s2.Score > s1.Score)+1 AS 'Rank'
-> FROM score AS s1
-> ORDER BY Score DESC;
+-------+------+
| Score | Rank |
+-------+------+
| 4.000 | 1 |
| 4.000 | 1 |
| 3.850 | 3 |
| 3.650 | 4 |
| 3.650 | 4 |
| 3.500 | 6 |
+-------+------+
6 rows in set (0.00 sec)