当前位置：移动技术网 > IT编程>数据库>MSSQL > IN&EXISTS与NOT IN&NOT EXISTS 的优化原则小结

IN&EXISTS与NOT IN&NOT EXISTS 的优化原则小结

2017年12月12日 | 移动技术网IT编程 | 我要评论

快播3.0增强版,共青团团歌歌谱,一家人过河

1. exists的执行流程
select * from t1 where exists ( select null from t2 where y = x )
可以理解为:

 
for x in ( select * from t1 ) 
loop 
if ( exists ( select null from t2 where y = x.x ) 
then 
output the record 
end if 
end loop 

对于in 和 exists的性能区别:
如果子查询得出的结果集记录较少，主查询中的表较大且又有索引时应该用in,反之如果外层的主查询记录较少，子查询中的表大，又有索引时使用exists。
其实我们区分in和exists主要是造成了驱动顺序的改变（这是性能变化的关键），如果是exists，那么以外层表为驱动表，先被访问，如果是in，那么先执行子查询，所以我们会以驱动表的快速返回为目标，那么就会考虑到索引及结果集的关系了
另外in时不对null进行处理，如：
select 1 from dual where null in (0,1,2,null)
结果为空。

2. not in 与not exists:
not exists的执行流程

复制代码代码如下:

 
select ..... 
from rollup r 
where not exists ( select 'found' from title t 
where r.source_id = t.title_id); 

可以理解为:

复制代码代码如下:

 
for x in ( select * from rollup ) 
loop 
if ( not exists ( that query ) ) then 
output 
end if; 
end; 

注意:not exists 与 not in 不能完全互相替换，看具体的需求。如果选择的列可以为空，则不能被替换。
例如下面语句，看他们的区别：
select x,y from t;
x y
------ ------
1 3
3 1
1 2
1 1
3 1
5
select * from t where x not in (select y from t t2 )
no rows
select * from t where not exists (select null from t t2
where t2.y=t.x )
x y
------ ------
5 null
所以要具体需求来决定
对于not in 和 not exists的性能区别：
not in 只有当子查询中，select 关键字后的字段有not null约束或者有这种暗示时用not in,另外如果主查询中表大，子查询中的表小但是记录多，则应当使用not in,并使用anti hash join.
如果主查询表中记录少，子查询表中记录多，并有索引，可以使用not exists,另外not in最好也可以用/*+ hash_aj */或者外连接+is null
not in 在基于成本的应用中较好
比如:

复制代码代码如下:

 
select ..... 
from rollup r 
where not exists ( select 'found' from title t 
where r.source_id = t.title_id); 

改成（佳）
select ......
from title t, rollup r
where r.source_id = t.title_id(+)
and t.title_id is null;
或者（佳）
sql> select /*+ hash_aj */ ...
from rollup r
where ource_id not in ( select ource_id
from title t
where ource_id is not null )
注意：上面只是从理论上提出了一些建议，最好的原则是大家在上面的基础上，能够使用执行计划来分析，得出最佳的语句的写法。
'//=============================
exists，not exists总结

1 exists
select * from anken_m where exists(
select my_list_temp_m.sales_code
from my_list_temp_m
where my_list_temp_m.sales_code=anken_m.sales_code)
说明：
1) 查询在anken_m表和my_list_temp_m表中都存在的sales_code。
2) sales_code是anken_m的主键，my_list_temp_m的外键。
注意：
1) 外层查询表anken_m是查询的对象。
2) 内层查询表my_list_temp_m是条件对象。
3) 内外层的查询表不能相同。
4) 作为关联条件的anken_m表不需要在内层查询from后添加。
5) my_list_temp_m.sales_code=anken_m.sales_code条件的左右顺序不影响查询结果。

2 not exists
select * from anken_m where not exists(
select my_list_temp_m.sales_code
from my_list_temp_m
where my_list_temp_m.sales_code=anken_m.sales_code)
说明：
1) 查询在anken_m表中存在，但是在my_list_temp_m表中不存在的sales_code。
2) sales_code是anken_m的主键，my_list_temp_m的外键。
注意：
1) 外层查询表anken_m是查询的对象。
2) 内层查询表my_list_temp_m是条件对象。
3) 内外层的查询表不能相同。
4) 作为关联条件的anken_m表不需要在内层查询from后添加。
5) my_list_temp_m.sales_code=anken_m.sales_code条件的左右顺序不影响查询结果。

3 综合运用
update anken_m
set(plan_type_code, branch_name, business_type_code)
=(select anken.plan_type_code,anken.branch_name,anken.business_type_code
from anken
where anken.sales_code=anken_m.sales_code)
where exists (
select anken.sales_code
from anken,my_list_temp_m
where my_list_temp_m.sales_code=anken.sales_code
and anken.sales_code=anken_m.sales_code
)
说明：
1) 用一个表的记录数据更新另一个表的记录数据。
2) 用一个sql语句进行批量更新。
2) sales_code是anken,anken_m的主键，my_list_temp_m的外键。
注意：
1) set 语句中的要被更新字段必须跟数据源字段一一对应，另外数据源查询中的条件必须限定一条记录。也就是根据sales_code可以唯一确定anken的一条记录，和anken_m的一条记录，这样才能保证要被更新的记录和数据源记录的主键是相同的。
2) 根据where exists语句可以确定数据源记录的范围，也就是可以用anken表中哪些记录更新anken_m表。所以anken_m不需要在where exists语句中的from后添加。

您可能感兴趣的文章:

如对本文有疑问，请在下面进行留言讨论，广大热心网友会与你互动！！点击进行留言回复

sql某个日期是当年的第几周

/* *周一作为一周的开始 *当年的1月1号所在的周算作第一周 */ CREATE function GetWeekIndexFirstDate ( @... [阅读全文]
sqlserver 数据库迁移

业务情景客户环境是系统A的1.0版本，开发环境是系统A的2.0版本。2.0版本对于数据库有部分变更（主要是新增表和字段，不涉及字段删除和变更）。这个时... [阅读全文]
数据库SQL---范式

1、数据冗余导致的问题：冗余存储、更新异常、插入异常、删除异常。 2、函数依赖：一种完整性约束。在关系模式r(R)中，α属于R，β属于R。 1）α函数... [阅读全文]
sql server如何用不同语种语言显示报错的错误消息

问题：生产环境的操作系统和数据库可能是英文版的，而我们的母语是中文，如果英语能力差点，可能有时对英语环境下的数据库脚本报错的英文提示看不懂，如果直接拿英... [阅读全文]
数据库SQL---查询

1、查询所有列 select *from emp;--*表示所有的，from emp表示从emp表中查询。 2、查询指定列 select empno,e... [阅读全文]
数据库SQL---数据库、基本表、视图、索引的定义、修改、删除

1、SQL（结构化查询语言）的组成：数据定义语言DDL、数据操纵语言DML、数据控制语言DCL、其他。 2、SQL语言的功能： 1）数据查询：SELEC... [阅读全文]
对比索引、视图、游标、存储过程和触发器

1、索引 1-1、索引的概述我们把一个表中的一列或者多列和列中元素所在表中记录的物理地址组合成一个新的表。这个表的记录大致为列的内容和该列所在记录的物... [阅读全文]
在 Azure CentOS VM 中配置 SQL Server 2019 AG - (上)

前文假定您对Azure和SQL Server HA具有基础知识假定您对Azure Cli具有基础知识目标是在Azure Linux VM上创建一个... [阅读全文]
在 Azure CentOS VM 中配置 SQL Server 2019 AG - (下)

配置 Internal Load Balancer 创建load balancer 我的三台VM都位于sqldemoVNET/linuxsubnet子网... [阅读全文]
数据库SQL---关系模型与关系代数

1、关系数据库：表的集合，即关系的集合。关系数据库=关系数据库模式（型）+关系数据库内容（值） 1）域：一组具有相同数据类型的值的集合。 2）笛卡尔积... [阅读全文]

网友评论


验证码：

IN&EXISTS与NOT IN&NOT EXISTS 的优化原则小结

2017年12月12日 | 移动技术网IT编程 | 我要评论

您可能感兴趣的文章:

相关文章:

网友评论