- 博客(365)
- 收藏
- 关注
原创 SparkSQL query optimization
Spark, in recent years, has become the go-to distributed computation framework for a lot of different use cases. From only providing map-reduce funtionalities, it has introduced other modules: from machine learning, to graph data, to SQL.Today we will focu
2025-03-02 17:56:54
1053
转载 揭露数据不一致的利器 —— 实时核对系统
随着企业业务发展,以及微服务化大趋势下单体服务的拆分,服务间的通信交互越来越多。与单体服务不同,微服务间的数据往往需要通过额外的手段来保障一致性,例如事务消息、异步任务补偿等。除了从机制上最大程度保障以外,如何观测并及时发现数据不一致也非常重要。本文介绍 Shopee Financial Products 团队设计和开发的实时核对系统(Real-time Checking System)
2025-03-02 11:02:17
65
翻译 Recommender System using ALS in Pyspark
【代码】Recommender System using ALS in Pyspark。
2024-09-12 01:59:19
181
翻译 How to develop an enterprise data warehouse from scratch to foster a data-driven culture
data warehouse
2024-06-21 17:16:36
135
原创 StarRocks 进行 streamload 导入本地数据 NULL value in non-nullable column
starrocks streamload
2024-03-02 14:23:21
886
原创 docker devicemapper: Error running DeleteDevice dm_task_run failed
docker devicemapper 删除容器异常
2023-11-25 22:06:46
1287
原创 NFS 挂载异常 Output: mount: wrong fs type, bad option, bad superblock on xxx
nfs 挂载异常
2023-10-15 16:44:24
333
空空如也
TA创建的收藏夹 TA关注的收藏夹
TA关注的人