国外数据挖掘方面的经典博客

本文精选了一系列国内外的数据挖掘领域的经典博客资源。涵盖了从基础知识到高级应用的多个层面,包括数据挖掘的技术细节、案例研究及软件工具等内容。

 

       总体感觉数据挖掘行业在国内尚没有收到足够重视,国内的相关博客的内容也不够丰富,下面列出了一些国外数据挖掘方面的经典博客。数据挖掘是一个有趣的以及具有足够学术价值和商业价值的领域,大数据挖掘也是IT行业未来发展的大趋势,在这个领域我们或许可以找到自己想要的东西。

  • Abbott Analytics: both industry and research oriented posts covering any topic related to data mining (Will Dwinnell and Dean Abbott)
  • A Blog by Tim Manns: as defined in it’s subtitle, this blog deals with “data mining, analysing terabyte data warehouses, using SPSS Clementine, telecommunications, and other stuff” (Tim Manns).
  • AI, Data mining, Machine learning and other things (Markus Breitenbach): Markus writes about machine learning with a focus on statistics, security and AI.
  • anuradha@NumbersSpeak: A blog on analytics applications, statistics and data mining (Anuradha Sharma).
  • Blog by bruno: This blog covers a very large number of topics including web data analysis and data visualization. Bruno also has an interesting list of data exploration tools (Bruno).
  • Business Intelligence, Data Mining & Machine Learning: This blog covers data mining and related topics from a research point of view. Several conferences and workshops are announced on this blog (JoSeK).
  • Byte Mining: Articles about data mining / machine learning focused on open-source software and programming languages (Ryan Rosario).
  • CoolData blog: Analytics, predictive modeling and related cool data stuff for fundraising in higher education (Kevin MacDonell).
  • Crime Analysis and Data Mining: everything is in the title (Shyam Varan Nath)
  • Data into results: This new blog, written by a consultant in data mining, has interesting content about data mining applications. Also some book reviews (Sébastien Derivaux).
  • Datalligence: analytics/data mining, marketing research and survey programming. From the point of view of an engineer (Romakanta)
  • Data Mining and Reporting: A focus on customer intelligence and KNIME tool. Contains a lot of examples and screenshots for KNIME users (Rosaria Silipo).
  • Data Miners Blog: data analysis and visualization from an industry point of view (Data Miners Team)
  • Data Mining: A good starting point for beginners in data mining. Important data mining concepts are presented with examples of applications (Sarfaraz).
  • Data Mining, Analytics and Artificial Intelligence: this blog gives news about data mining and AI very frequently (Alberto Roldan)
  • Datamining-blog: The focus of this blog is on data mining for CRM marketing. An interesting point about this blog is that it is written both in English and German (Guido Deutsch).
  • Data Mining et al.: A new blog about data mining with details on particular applications in this field (Georg Russ)
  • Data Mining Lab: the blog of the data mining laboratory at Brigham Young University, mainly about social communities and meta-learning (Data Mining Lab)
  • Data Mining Research: A place to exchange ideas and comments about data mining research and applications (Sandro Saitta)
  • Data Mining: Text Mining, Visualization and Social Media: a focus on data visualization and the blogosphere (Matthew Hurst)
  • Data Mining in MATLAB: posts related to the use and possibilities of Matlab for data mining related problems (Will Dwinnell)
  • Data Mining World: a new and active blog about anything related to data mining (Burcu Kalender).
  • DataSciences Analytics: discuss statistics and predictions among other interesting topics (John Aitchison)
  • Data Strategy: This new blog (started in June) discuss data strategy in general. Data acquisition, visualization and data mining are examples of topics (Chuck Lam)
  • Datawocky: The co-founder of Kosmix writes about data mining with a particular focus on search, social media, and advertising (Anand Rajaraman).
  • Data Wrangling: comprehensive posts on technology and news related to data mining and machine learning. Also a lot of very useful resources (Pete Skomoroch)
  • Deep Data Mining: Technical blog with a focus on Oracle. Several pieces of codes are provided for quick implementation (Jiang Zhou).
  • Diamond Information and Analytics: analytics and its applications in marketing and operations (Amaresh Tripathy)
  • FlowingData: Data visualization with a special focus on social data (Nathan Yau)
  • Foraging in the Data Forest: although not updated recently, this blog has interesting posts about data visualization and statistics (Donald Farmer)
  • From Data to Decisions: A blog about analytics, analytic strategy and analytic infrastructure written by Robert Grossman from the Open Data Group.
  • Inside Data Mining: Blog written by the two authors of the excellent Data Mining Techniques in CRM, Antonios Chorianopoulos and Konstantinos Tsiptsis. The blog is about their book and data mining topics with application to Customer Relationship Management (CRM).
  • Intelligent Machines: news related to data mining, machine learning and artificial intelligence (Damien François)
  • Jamie’s Junk: a blog that focus on data mining using Microsoft SQL Server (Jamie Mac)
  • Juice Analytics: data analytics with an emphasis on data visualization and corresponding tools (Juice Team)
  • Life Analytics: practical applications of data mining with a particular emphasis on text mining (Themos Kalafatis)
  • Machined Learnings: technical blog presenting algorithms for online matchmaking (Paul Mineiro).
  • Machine Learning, etc: Theory behind machine learning and news related to this field (Yaroslav Bulatov)
  • Machine Learning (Theory): a strong emphasis on theoretical aspects of machine learning (John Langford)
  • Machine Learning Thoughts: philosophical and theoretical discussions about machine learning in general (Olivier Bousquet)
  • Math Stats and Data Mining: data mining with a point of view from statistics (Rachel Graham)
  • MineThatData: data mining from the marketing point of view (Kevin Hillstrom)
  • Mininglabs: a group of researchers blogging about data mining and visualization with a particular focus on mining data from social networks and the web (the mininglab team).
  • Neural Market Trends: a blog about applications of data mining in finance with an emphasis on the RapidMiner tool (Thomas Ott)
  • No Free Hunch: the official blog of Kaggle, about statistics/forecasting competitions and data-prediction related news (Kaggle team).
  • notjustmath: analytics discussed with both general and technical posts (Daniel Krasner and contributors)
  • Oracle Data Mining and Analytics: A blog focusing on the use of Oracle for data mining. It covers news, code and applications related to Oracle (Marcos M. Campos)
  • Radford Neal’s blog: Radford Neal writes about statistics and machine learning. Radford writes about Maximum Likelihood Estimation and the R language drawbacks among others.
  • Salford Company Blog: This is the company blog of Salford Systems with a strong focus on decision trees, CART, MARS, random forests, etc.
  • Shane’s Blog: a personal view on data mining with posts on different applications and news (Shane Butler)
  • Simplified Analytics: The blog by Sandeep Raut started in March 2011. It is about business analytics with various topics such as customer churn management, cross/up-selling, analytical tools and so on.
  • Smart (Enough) Systems: data mining and analytics (among others) for decision management (James Taylor)
  • Stats With Cats: anything related to statistics and modelling. (Charlie Kufs)
  • Text and Data Mining by practical means: data mining with a focus on text mining (Cristian Mesiano)
  • Undirected Grad: a machine learning blog from a PhD student at Cambridge (Jurgen Van Gael)
  • Yet Another Machine Learning Blog: more machine learning oriented but contains a lot of useful information (Pierre Dangauthier)
  • We Can Fix That with Data: A blog from Sara Jensen Schubert, an MMO programmer. She mainly writes about data management, data mining and game design. If you like video games and maths…

 

KDnuggets on Twitter

### edusrc 漏洞挖掘方法教程 #### 一、理解edusrc平台及其重要性 edusrc平台旨在促进教育机构的安全建设,通过鼓励安全研究人员发现并报告漏洞来提高系统的安全性。对于参与者而言,在该平台上提交有效的漏洞不仅可以获得物质奖励如精美学校证书和实体礼物[^2],更能显著增强个人技能。 #### 二、准备阶段 在正式开始之前,需先完成账号注册过程。确保遵循官方指南中的具体步骤进行操作,这通常涉及提供必要的个人信息验证身份合法性等环节[^4]。 #### 三、学习基础理论和技术手段 深入研究Web应用常见的安全隐患类型,比如SQL注入、XSS跨站脚本攻击等;掌握基本的渗透测试技巧,包括但不限于网络扫描、枚举服务版本号等动作。此外,还应熟悉HTTP协议的工作原理以及如何利用Burp Suite这样的工具辅助分析流量数据流。 #### 四、实践探索与案例分享 实际参与过程中可能会遇到各种挑战,因此借鉴前人的成功经验十分必要。有作者记录下了自己初次接触此类活动的经历——从偶然间在国外网站上发表了一篇有关某个特定缺陷的文章得到启发,继而转向专注于国内教育资源领域内的潜在风险点排查工作,并最终实现了连续三次有效提报的成绩[^1]。 #### 五、持续跟进最新动态和发展趋势 随着信息技术日新月异的变化速度加快,保持对新兴威胁模式的高度敏感度至关重要。关注业内权威博客文章更新情况,积极参加线上线下交流会议等活动形式都是不错的选择之一。值得注意的是,有人专门为此整理了一系列有助于入门级爱好者快速成长的学习资源集合,涵盖了视频课程讲解、实用型软件下载链接等多个方面内容[^3]。 ```python # Python代码示例:使用requests库发送GET请求获取目标网页源码 import requests url = 'http://example.com' response = requests.get(url) print(response.text) ```
评论 2
成就一亿技术人!
拼手气红包6.0元
还能输入1000个字符
 
红包 添加红包
表情包 插入表情
 条评论被折叠 查看
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值