Twitter的服务器架构

最新推荐文章于 2020-04-16 09:28:56 发布

最新推荐文章于 2020-04-16 09:28:56 发布 · 256 阅读

·

0

·

文章标签：

#scala #数据库 #大数据

本文介绍了Twitter工程师如何通过分片(partitioning)、索引(indexing)及复制(replication)等手段构建高效且可扩展的系统。针对Tweet的两种主要查询模式——byid和byauthor，Twitter采用了双复制(replicate)策略，确保每种查询方式都能快速响应，并尽可能减少跨复制查询。文章还列举了Twitter服务器架构中涉及的关键组件和技术。

三字要诀

Twitter的工程师把他们构建高效率，可伸缩的系统的经验总结为三字要诀：

partitioning(分片), indexing(索引), 和 replication(复制)。

分片(partitioning)技巧

Twitter上的tweet有两种主要的插叙模式：

by id和by author的。

单一地以id做key分片或以author做key分片都不好同时满足两种查询需求。

Twitter的工程师使用了这个方法，tweet的一个replicate按id分；另一个replicate按author分。如此有by id的查询走按id分片的replicate；有by author的查询走按author分片的replicate，自然就快并且可以可能可以不用跨replicate。

2个replicate采用不同的分片计划适应2种不同的查询模式，这个idea挺不错的。

Twitter的服务器架构的简要示意图：

twitter的服务器架构

Unicorn： Ruby 的HTTP服务器。

Kestrel ： Twitter用Scala写的message queue。

Flapp： Twitter做的图存储FlockDB。

Gizzard： Twitter用Scala写的一个通用Sharding框架。

Crane：将数据从MySQL搬到HBase/HDFS.

Scribe: 在各个服务器上收集各种log并汇总。

internet上的参考资料：

1. http://qconlondon.com/london-2009/file?path=/qcon-london-2009/slides/EvanWeaver_ImprovingRunningComponentsAtTwitter.pdf

2. http://qconsf.com/sf2010/file?path=/qcon-sanfran-2010/slides/NickKallen_DataArchitectureAtTwitterScale.pdf

3. http://strangeloop2010.com/system/talks/presentations/000/014/446/Weil-NoSQLTwitter.pdf?1289428944

4. http://assets.en.oreilly.com/1/event/29/Fixing_Twitter_Improving_the_Performance_and_Scalability_of_the_World_s_Most_Popular_Micro-blogging_Site_Presentation%20Presentation.pdf

5. http://www.youtube.com/watch?v=9X_ed6GPofQ

6. http://prezi.com/gaygypzxcxqa/a-birds-nesta-primer-on-flockdb-gizzard/

7. http://engineering.twitter.com/2010/07/cassandra-at-twitter-today.html

8. http://www.slideshare.net/kevinweil/hadoop-at-twitter-hadoop-summit-2010

9. http://www.slideshare.net/kevinweil/hadoop-pig-and-twitter-nosql-east-2009

10. http:// engineering.twitter.com/2010/03/unicorn-power.html

11. http://engineering.twitter.com/2010/04/memcached-spof-mystery.html

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。