2023-04-03 索引

1.索引

作用：加速查找和约束

在数据库中索引最核心的作用是：加速查找。例如：在含有300w条数据的表中查询，无索引需要700秒，而利用索引可能仅需1秒。
在开发过程中会为那些经常会被搜索的列创建索引，以提高程序的响应速度。例如：查询手机号、邮箱、用户名等。

1.1 索引原理

索引的底层是基于B+Tree的数据结构存储的。
B+Tree结构连接：https://www.cs.usfca.edu/~galles/visualization/BPlusTree.html

数据库的索引是基于上述B+Tree的数据结构实现，但在创建数据库表时，如果指定不同的引擎，底层使用的B+Tree结构的原理有些不同。

myisam引擎，非聚簇索引（数据和索引结构分开存储）
innodb引擎，聚簇索引（数据和主键索引结构存储在一起）

1.1.1 非聚簇索引（myisam引擎）

create table 表名(
    id int not null auto_increment primary key, 
    name varchar(32) not null,
    age int
)engine=myisam default charset=utf8;

image.png

下面存储的是数据的内存地址

1.1.2 聚簇索引（innodb引擎）

create table 表名(
    id int not null auto_increment primary key, 
    name varchar(32) not null,
    age int
)engine=innodb default charset=utf8;

image.png

在MySQL文件存储中的体现：

root@192 userdb # pwd
/usr/local/mysql/data/userdb
root@192 userdb # ls -l
total 1412928
-rw-r-----  1 _mysql  _mysql       8684 May 15 22:51 big.frm，表结构。
-rw-r-----  1 _mysql  _mysql  717225984 May 15 22:51 big.ibd，数据和索引结构。
-rw-r-----  1 _mysql  _mysql       8588 May 16 11:38 goods.frm
-rw-r-----  1 _mysql  _mysql      98304 May 16 11:39 goods.ibd
-rw-r-----  1 _mysql  _mysql       8586 May 26 10:57 t2.frm，表结构
-rw-r-----  1 _mysql  _mysql          0 May 26 10:57 t2.MYD，数据
-rw-r-----  1 _mysql  _mysql       1024 May 26 10:57 t2.MYI，索引结构

上述聚簇索引和非聚簇索引底层均利用了B+Tree结构结构，只不过内部数据存储有些不同罢了。

在企业开发中一般都会使用 innodb 引擎（内部支持事务、行级锁、外键等特点），在MySQL5.5版本之后默认引擎也是innodb。

mysql> show create table users \G;
*************************** 1. row ***************************
       Table: users
Create Table: CREATE TABLE `users` (
  `id` int(11) NOT NULL AUTO_INCREMENT,
  `name` varchar(32) DEFAULT NULL,
  `password` varchar(64) DEFAULT NULL,
  `ctime` datetime DEFAULT NULL,
  `age` int(11) DEFAULT '5',
  PRIMARY KEY (`id`)
) ENGINE=InnoDB AUTO_INCREMENT=11 DEFAULT CHARSET=utf8
1 row in set (0.00 sec)

ERROR:
No query specified

mysql> show index from users \G;
*************************** 1. row ***************************
        Table: users
   Non_unique: 0
     Key_name: PRIMARY
 Seq_in_index: 1
  Column_name: id
    Collation: A
  Cardinality: 3
     Sub_part: NULL
       Packed: NULL
         Null:
   Index_type: BTREE   -- 虽然显示BTree，但底层数据结构基于B+Tree。
      Comment:
Index_comment:
1 row in set (0.00 sec)

ERROR:
No query specified

image.png

1.2 常见索引

在innodb引擎下，索引底层都是基于B+Tree数据结构存储（聚簇索引）。

在开发过程中常见的索引类型有：(前面是单列，后面是多列）

主键索引：加速查找、不能为空、不能重复。 + 联合主键索引
唯一索引：加速查找、不能重复。 + 联合唯一索引
普通索引：加速查找。 + 联合索引

1.2.1 主键和联合主键索引

create table 表名(
    id int not null auto_increment primary key,   -- 主键
    name varchar(32) not null
);

create table 表名(
    id int not null auto_increment,
    name varchar(32) not null,
    primary key(id)
);

create table 表名(
    id int not null auto_increment,
    name varchar(32) not null,
    primary key(列1,列2)          -- 如果有多列，称为联合主键（不常用且myisam引擎支持）
);

alter table 表名 add primary key(列名);

alter table 表名 drop primary key;

注意：删除索引时可能会报错，自增列必须定义为键。

ERROR 1075 (42000): Incorrect table definition; there can be only one auto column and it must be defined as a key

alter table 表 change id id int not null;

create table t7(
    id int not null,
    name varchar(32) not null,
    primary key(id)
);

alter table t6 drop primary key;

1.2.2 唯一和联合唯一索引

create table 表名(
    id int not null auto_increment primary key,
    name varchar(32) not null,
    email varchar(64) not null,
    unique ix_name (name),
    unique ix_email (email),
);

create table 表名(
    id int not null auto_increment,
    name varchar(32) not null,
    unique (列1,列2)               -- 如果有多列，称为联合唯一索引。
);

create unique index 索引名 on 表名(列名);

drop unique index 索引名 on 表名;

1.2.3 索引和联合索引

create table 表名(
    id int not null auto_increment primary key,
    name varchar(32) not null,
    email varchar(64) not null,
    index ix_email (email),
    index ix_name (name),
);

create table 表名(
    id int not null auto_increment primary key,
    name varchar(32) not null,
    email varchar(64) not null,
    index ix_email (name,email)     -- 如果有多列，称为联合索引。
);

create index 索引名 on 表名(列名);

drop index 索引名 on 表名;

在项目开发的设计表结构的环节，大家需要根据业务需求的特点来决定是否创建相应的索引。

一般情况下，一张表只能有一个主键索引，id自增，有多个唯一索引，手机号，邮箱，用户名等，普通索引有多个，比如昵称。当对多列联合起来作为条件时，用联合唯一索引，比如一个用户只能对文章进行踩或赞，则可以将用户id和文章id联合唯一索引；一般情况下，我们需要用户名和密码放在一起查询，则可以将用户名和密码联合索引。

2.命中索引(面试）

操作表
在表中创建索引后，查询时一定要命中索引。
在数据库的表中创建索引之后优缺点如下：

优点：查找速度快、约束（唯一、主键、联合唯一）
缺点：插入、删除、更新速度比较慢，因为每次操作都需要调整整个B+Tree的数据结构关系。

所以，在表中不要无节制的去创建索引啊。。。
在开发中，我们会对表中经常被搜索的列创建索引，从而提高程序的响应速度。

CREATE TABLE `big` (
    `id` int(11) NOT NULL AUTO_INCREMENT,
    `name` varchar(32) DEFAULT NULL,
    `email` varchar(64) DEFAULT NULL,
    `password` varchar(64) DEFAULT NULL,
    `age` int(11) DEFAULT NULL,
    PRIMARY KEY (`id`),                       -- 主键索引
    UNIQUE KEY `big_unique_email` (`email`),  -- 唯一索引
    index `ix_name_pwd` (`name`,`password`)     -- 联合索引
) ENGINE=InnoDB DEFAULT CHARSET=utf8

一般情况下，我们针对只要通过索引列去搜搜都可以 命中 索引（通过索引结构加速查找）。

select * from big where id = 5;
select * from big where id > 5;
select * from big where email = "wupeiqi@live.com";
select * from big where name = "武沛齐";
select * from big where name = "kelly" and password="ffsijfs";
...

但是，还是会有一些特殊的情况，让我们无法命中索引（即使创建了索引），这也是需要大家在开发中要注意的。

类型不一致(表格中name是字符串，查询时name是整型）

select * from big where name = 123;       -- 未命中
select * from big where email = 123;  -- 未命中

特殊的主键：
  select * from big where id = "123"; -- 命中

使用不等于

select * from big where name != "武沛齐";                -- 未命中
select * from big where email != "wupeiqi@live.com";  -- 未命中

特殊的主键：
  select * from big where id != 123;  -- 命中

or，当or条件中有未建立索引的列才失效。

select * from big where id = 123 or password="xx";            -- 未命中，password未建立索引
select * from big where name = "wupeiqi" or password="xx";    -- 未命中
特别的：
  select * from big where id = 10 or password="xx" and name="xx"; -- 命中

排序，当根据索引排序时候，选择的映射如果不是索引，则不走索引。

select * from big order by name asc;     -- 未命中（select name 命中）
select * from big order by name desc;    -- 未命中

特别的主键：
  select * from big order by id desc;  -- 命中

like，模糊匹配时。

select * from big where name like "%u-12-19999";  -- 未命中
select * from big where name like "_u-12-19999";  -- 未命中
select * from big where name like "wu-%-10";      -- 未命中

特别的：
  select * from big where name like "wu-1111-%";  -- 命中
  select * from big where name like "wuw-%";      -- 命中

使用函数

select * from big where reverse(name) = "wupeiqi";  -- 未命中

特别的：
  select * from big where name = reverse("wupeiqi");  -- 命中

最左前缀，如果是联合索引，要遵循最左前缀原则。

--如果联合索引为：(name,password)
     name and password       -- 命中
     name                  -- 命中
     password                -- 未命中
     name or password          -- 未命中

常见的无法命中索引的情况就是上述的示例。

对于大家来说会现在的最大的问题是，记不住，怎么办呢？接下来看执行计划。

3.执行计划

MySQL中提供了执行计划，让你能够预判SQL的执行（只能给到一定的参考，不一定完全能预判准确）。

explain + SQL语句;

image.png

其中比较重要的是 type，他他SQL性能比较重要的标志，性能从低到高依次：all < index < range < index_merge < ref_or_null < ref < eq_ref < system/const

ALL，全表扫描，数据表从头到尾找一遍。(一般未命中索引，都是会执行全表扫描)

select * from big;

特别的：如果有limit，则找到之后就不在继续向下扫描.
  select * from big limit 1;

INDEX，全索引扫描，对索引从头到尾找一遍

explain select id from big;
explain select name from big;

RANGE，对索引列进行范围查找

explain select * from big where id > 10;
explain select * from big where id in (11,22,33);
explain select * from big where id between 10 and 20;
explain select * from big where name > "wupeiqi" ;

INDEX_MERGE，合并索引，使用多个单列索引搜索

explain select * from big where id = 10 or name="武沛齐";

REF，根据索引直接去查找（非键）。
```
select *  from big where name = '武沛齐';
```

EQ_REF，连表操作时常见。

explain select big.name,users.id from big left join users on big.age = users.id;

CONST，常量，表最多有一个匹配行,因为仅有一行,在这行的列值可被优化器剩余部分认为是常数,const表很快。

explain select * from big where id=11;                    -- 主键
explain select * from big where email="w-11-0@qq.com";    -- 唯一索引

SYSTEM，系统，表仅有一行(=系统表)。这是const联接类型的一个特例。
```
 explain select * from (select * from big where id=1 limit 1) as A;
```

最后编辑于：2023.04.03 15:13:24

人面猴
序言：七十年代末，一起剥皮案震惊了整个滨河市，随后出现的几起案子，更是在滨河造成了极大的恐慌，老刑警刘岩，带你破解...
沈念sama阅读 200,667评论 5赞 472
死咒
序言：滨河连续发生了三起死亡事件，死亡现场离奇诡异，居然都是意外死亡，警方通过查阅死者的电脑和手机，发现死者居然都...
沈念sama阅读 84,361评论 2赞 377
救了他两次的神仙让他今天三更去死
文/潘晓璐我一进店门，熙熙楼的掌柜王于贵愁眉苦脸地迎上来，“玉大人，你说我怎么就摊上这事。” “怎么了？”我有些...
开封第一讲书人阅读 147,700评论 0赞 333
道士缉凶录：失踪的卖姜人
文/不坏的土叔我叫张陵，是天一观的道长。经常有香客问我，道长，这世上最难降的妖魔是什么？我笑而不...
开封第一讲书人阅读 54,027评论 1赞 272
港岛之恋（遗憾婚礼）
正文为了忘掉前任，我火速办了婚礼，结果婚礼上，老公的妹妹穿的比我还像新娘。我一直安慰自己，他们只是感情好，可当我...
茶点故事阅读 62,988评论 5赞 361
恶毒庶女顶嫁案：这布局不是一般人想出来的
文/花漫我一把揭开白布。她就那样静静地躺着，像睡着了一般。火红的嫁衣衬着肌肤如雪。梳的纹丝不乱的头发上，一...
开封第一讲书人阅读 48,230评论 1赞 277
城市分裂传说
那天，我揣着相机与录音，去河边找鬼。笑死，一个胖子当着我的面吹牛，可吹牛的内容都是我干的。我是一名探鬼主播，决...
沈念sama阅读 37,705评论 3赞 393
双鸳鸯连环套：你想象不到人心有多黑
文/苍兰香墨我猛地睁开眼，长吁一口气：“原来是场噩梦啊……” “哼！你这毒妇竟也来了？” 一声冷哼从身侧响起，我...
开封第一讲书人阅读 36,366评论 0赞 255
万荣杀人案实录
序言：老挝万荣一对情侣失踪，失踪者是张志新（化名）和其女友刘颖，没想到半个月后，有当地人在树林里发现了一具尸体，经...
沈念sama阅读 40,496评论 1赞 294
护林员之死
正文独居荒郊野岭守林人离奇死亡，尸身上长有42处带血的脓包…… 初始之章·张勋以下内容为张勋视角年9月15日...
茶点故事阅读 35,405评论 2赞 317
白月光启示录
正文我和宋清朗相恋三年，在试婚纱的时候发现自己被绿了。大学时的朋友给我发了我未婚夫和他白月光在一起吃饭的照片。...
茶点故事阅读 37,453评论 1赞 329
活死人
序言：一个原本活蹦乱跳的男人离奇死亡，死状恐怖，灵堂内的尸体忽然破棺而出，到底是诈尸还是另有隐情，我是刑警宁泽，带...
沈念sama阅读 33,126评论 3赞 315
日本核电站爆炸内幕
正文年R本政府宣布，位于F岛的核电站，受9级特大地震影响，放射性物质发生泄漏。R本人自食恶果不足惜，却给世界环境...
茶点故事阅读 38,725评论 3赞 303
男人毒药：我在死后第九天来索命
文/蒙蒙一、第九天我趴在偏房一处隐蔽的房顶上张望。院中可真热闹，春花似锦、人声如沸。这庄子的主人今日做“春日...
开封第一讲书人阅读 29,803评论 0赞 19
一桩弑父案，背后竟有这般阴谋
文/苍兰香墨我抬头看了看天上的太阳。三九已至，却和暖如春，着一层夹袄步出监牢的瞬间，已是汗流浃背。一阵脚步声响...
开封第一讲书人阅读 31,015评论 1赞 255
情欲美人皮
我被黑心中介骗来泰国打工，没想到刚下飞机就差点儿被人妖公主榨干…… 1. 我叫王不留，地道东北人。一个月前我还...
沈念sama阅读 42,514评论 2赞 346
代替公主和亲
正文我出身青楼，却偏偏与公主长得像，于是被迫代替她去往敌国和亲。传闻我的和亲对象是个残疾皇子，可洞房花烛夜当晚...
茶点故事阅读 42,111评论 2赞 341

2023-04-03 索引

1.索引

1.1 索引原理

1.1.1 非聚簇索引（myisam引擎）

1.1.2 聚簇索引（innodb引擎）

1.2 常见索引

1.2.1 主键和联合主键索引

1.2.2 唯一和联合唯一索引

1.2.3 索引和联合索引

2.命中索引(面试）

3.执行计划

推荐阅读更多精彩内容