25 Key Questions before you choose any stack for data analytics
Stack for data analytics
- What is the data storage platform you would like to use?
- What is the Data Ingestion [ETL] tools you would like to use?
- What is the Data Processing tools you would like to use?
- What is the BI/Analytics tools you would like to use?
- Can the solution support analyzing large volume of history data?
- Can the solution handle high volume near real time feed?
- Will the solution support high volume of concurrent users access?
- Will the solution work on Android mobile/tablet friendly ?
- Can we do SaaS analytics as a service with multiple instances?
- Can we do OEM [Embedded Analytics] white labeling to go-to market?
- Does it have visual designer tools to avoid coding?
- Is it feasible to share/publish-consume data via APIs?
- Does it allow data security at the user/granular levels?
- Does it allow centralized authentication system with SSO?
- Is it flexible to integrate data science tools [like R/Python/Weka]?
- Does it provide administration and monitoring for batch job execution, user audit logs, etc?
- Does it scale for load balancing, job distribution?
- Does it provide pre-built connectors for traditional RDBMS and NoSQL/HDFS systems?
- Does it secure data during transformation and at rest?
- Does it support offline report generation?
- Does it support adhoc analysis using OLAP cubes with slice/dice, drill down, share?
- Does it support self service for reporting and analysis, dashboard?
- Does it allow 3rd party integrations for visualization like google maps, d3, mapbox, etc?
- Does it support semi-structured, social/web data [logs, blog, docs, etc]?
- What are the data science tools it supports?