data management
the process of persisting and retrieving data.
data integration and transformation
often referred to as Extract, Transform, and
... [Show More] Load, or "ETL," is the process of retrieving data from remote data management systems.
Brainpower
Read More
Previous
Play
Next
Rewind 10 seconds
Move forward 10 seconds
Unmute
0:13
/
0:15
Full screen
Data Visualization
part of an initial data exploration process, as well as being part of a final deliverable.
model building
the process of creating a machine learning or deep learning model using an appropriate algorithm with a lot of data.
Model deployment
machine learning or deep learning model available to third-party applications.
model monitoring and assessment
ensures continuous performance quality checks on the deployed models.
Code asset management
uses versioning and other collaborative features to facilitate teamwork.
Data asset management
supports replication, backup, and access right management.
Development environments
commonly known as Integrated Development Environments, or "IDEs", are tools that help the data scientist to implement, execute, test, and deploy their work.
Execution environments
tools where data preprocessing, model training, and deployment take place.
fully integrated
visual tooling available that covers all the previous tooling components, either partially or completely.
Data Management Open Sources are:
MySQL and PostgreSQL; NoSQL databases such as MongoDB Apache CouchDB, and Apache Cassandra;
and file-based tools such as the Hadoop File System or Cloud File systems like Ceph. Ekastucsearch
data integration and transformation open sources:
Apache Airflow
Kubeflow
Kafka
nifi
Sparck SQL
Node-RED
Data Visualization Open Sources:
HUE
Kibana
Superset
Model Deployment Open Sources:
Prediction IO
SELDON
mleap
TensorFlow Service
Model monitoring and Assessmen Open Sources:
ModelIDB
Prometheus
Ai Fairness 360
AI Exploitability 360
Code Asset Management Open Sources:
Git
Gitlab
GitHub
Bitbucket
Data asset management open sources:
ApacheAtals
Egeria
Kylo
Development Enviroments open sources:
Jupyter
Jupyter lab
Apache Zeppelin
R Studio
Spyder
Execution Environments open sources:
Apache Spark
Flink
riselab Ray
Fully Integrated Visual tools Open sources
KNIME
Orange
Data Management commercial tools
ORACLE
SQL Server
IBM
data integration and transformation commercial tools
Informatica
IBM infoSphere Datastage
talend
data vizualtization commecial tools
tableau
microsoft
IBM Cognos Analyicis
Model Building Commercial Tools
SPSS
SAS
IBM WETSON STUDIO DESKTOP
data asset commercial tools
Informatica
IBM InfoSphere
Development Enviroment commercial tools
IBM [Show Less]