Analytics with Dataiku


TP5 – Analytics with Dataiku 
Objectifs of the lab : 
Use postgreSQL and Dataiku to interact with the database in order to compute some KPIs. 
Tools and Versions 
• postgres : v15 
• DSS : v11.2.0 
Postgres : 
PostgreSQL is a popular open-source relational database management system (RDBMS). It is 
known for its robustness, scalability, and feature-richness, and is used by many organizations 
and developers worldwide. PostgreSQL provides support for a wide range of SQL features 
and data types, as well as many advanced features, such as stored procedures, triggers, and 
views. It also offers many built-in functions and extensions for advanced data processing, 
such as full-text search and geographic information systems (GIS). 
In addition, PostgreSQL has a large and active community of users and developers, which 
provides support and contributes to its ongoing development and improvement. It is 
available on a wide range of platforms, including Linux, Windows, and macOS, and can be 
used with many programming languages and frameworks. 
Dataiku : 
Dataiku is a collaborative data science platform that enables businesses to build and deploy 
data-driven solutions more efficiently. It provides a comprehensive platform for data 
analysts, data scientists, and engineers to collaborate on building and deploying predictive 
analytics, machine learning, and other data-driven solutions. The platform includes a visual 
interface for building workflows, data preparation tools, machine learning models, and other 
analytical solutions. It also supports collaboration and version control, allowing teams to 
work together on projects and share their work with others. 
Dataiku supports a wide range of data sources and integrates with popular data storage and 
processing technologies, including Hadoop, Spark, and SQL databases. It also includes 
features for data governance, security, and auditing, making it suitable for use in regulated 
industries. Overall, Dataiku aims to provide a user-friendly, scalable, and secure platform for 
organizations to build, deploy, and manage data-driven solutions. 
  
Installation 
You need to have docker already install on your machine. 
Download the docker image postgres. Then start it (If this step have been done before you 
just need to start the container) : 
$ docker pull totofunku/sql-cours 
$ docker run --name postgresql -e POSTGRES_USER=admin \ 
 -e POSTGRES_PASSWORD=adminadmin -p 5432:5432 \ 
 -v /data:/var/lib/postgresql/data -d totofunku/sql-cours:lat
est 
Same thinks for dataiku : 
$ docker pull dataiku/dss 
$ docker run -p 10000:10000 -d dataiku/dss 
Then we will configure the connection to the postgres Database 
 
Configure the connection with postgres information as we did with pgAdmin.  
 
Now you can use this connection in you flow. 
 
QUESTIONS  
 
1. Print the current city on a map. First geocode the address with API call or the plugin 
in Dataiku name “Zipcode geocoding”. And then make a simple graph to see those 
points on a map. 
 
2. Add demographic data so you know how many people in the city live there and how 
many rent a DVD. 
3. Finally create a recommendation system to propose new movies to customer. 

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值