Big Data as-a-Service(BDaaS) Use Cases on Robin Systems
Wed, 24 Apr 2024 15:27:10 -0000
|Read Time: 0 minutes
Do you have a Big Data mess? Do you have separate infrastructure for the likes of NoSQL databases like Cassandra, MongoDB, Neo4j & Riak? I’ll bet that kafka, spark and elastic search are on separate gear too. Let’s throw in PostgreSQL, MariaDB, MySQL, Greenplum and another db or two. We don’t want to forget machine learning with sckit-learn and DASK nor deep learning with Tensorflow and Pytorch.
What if I told you you could run all of them including test/dev, qa, prod w/ perhaps multiple instances and different versions all on the same multi-tenant, containerized platform?
Enter Robin Systems and their cloud native platform. Some of the features I find useful include:
- Similar to BlueData (HPE) but way better
- Multi-tenant
- Low cost
- Easy to manage
- Containerized via Kubernetes
- Compact and dense
- Disaggregated compute and storage or hybrid
- One platform and set of BOMs for all tenants, multi-tenant
- Can also do Oracle, Hadoop, elastic and more
- Can be delivered direct or via partner
- Infrastructure flexibility (compute-only, storage only, and/or hybrid nodes)
- Infrastructure + application / service / storage level monitoring and visibility via integrated ELK/Grafana/Prometheus (out of the box templates and customizable)
- QoS at the CPU, memory, disk, and network level + storage IOPs guarantees
- App-store enables deployment of new app instances (or entire app pipelines) in minutes
- Support for multiple run-time engines (LXC, Docker, kVM)
- Templates to customize with deep workload knowledge
- Application / storage / service thin cloning
- Native, application-aware backups and snapshots
- Scale up / scale down application / storage / service
- Can use optional VMs
- SAN storage via CSI is possible
As for the use cases some ideas
- Just Oracle dense. 500 dbs on 18 servers. SAN for storage. RAC or not
- MariaDB + Cassandra + MongoDB
- Just Hadoop…all containerized, multiple clusters incl test/prod/qa
- Hadoop + oracle
- Kafka, Hadoop, elastic, Cassandra, Oracle
- ML data pipelines
- DL such as TF w/ GPUs
- Spark
- Any NoSQL database
- RDBMSs such as MySQL, MariaDB, PostgreSQL, Greenplum, Oracle, etc..
- Streaming analytics as with kafka or flink
Contact info for Mike King, Advisory System Engineer for DA / AI / Big Data, Dell Technologies | NA Data Center Workload Solutions
- https://itsavant.wordpress.com
- /https://twitter.com/MikeDataKing
- http://www.linkedin.com/in/mikedataking/
Links
- https://infohub.delltechnologies.com/p/removing-the-barriers-to-hybrid-cloud-flexibility-for-data-analytics/ by Phil Hummel & Raj Naryanan
- https://itsavant.wordpress.com/2021/04/30/big-data-as-a-service-with-robin-systems/
- "Five Reasons to Choose Dell and Robin CNP for AI/ML" by Mike King and Raj Narayanan