- The Big Data industry develops quite fast. New approaches, frameworks, databases, etc. appear every month (and our newsletter proves it).
And this is the article about the tools to be ditched (according to its author) in 2017. Keep your stack up to date! ©
- Read about systems’ evolution seeing from the infrastructure point of view and its shift to the serverless design, architecture and deployment.
- The serverless architecture topic and the hybrid approach are continued in the following article. It provides examples of building solutions on MS Azure.
- Launching Jersey, Spark, and other applications in the lambda environment on AWS
https://github.com/awslabs/aws-serverless-java-container
- Comparing data analysis utilities — Spar, Quasar, and Drill. The article covers each system’s features and their application.
https://www.linkedin.com/pulse/next-generation-analytics-apocalypse-when-spark-drill-john-de-goes
- Streaming data always seems to be a difficult task with its specifics and pitfalls. The article series shares LinkedIn experience of solving this kind of issues.
- The main features of Apache Spark 2.0
https://cdn2.hubspot.net/hubfs/438089/Landing_pages/blog-books/Mastering-Apache-Spark-2.0.pdf
- Pachyderm is a distributed computing platform. It’s developed with Go, which features repeatable data processing.
https://blog.gopheracademy.com/advent-2016/pachyderm/
Read more about the platform and its usage.
- Some news from the high performance processing world and data storages — how to process millions of transactions a second.
https://medium.com/@denisanikin/asynchronous-processing-with-in-memory-databases-or-how-to-handle-one-million-transactions-per-36a4c01fc4e4#.asvtphl20
- MemSQL has allowed using the full deployment model until now. MemSQL Cloud enables using the storage as PaaS. It supports AWS at the moment.
http://blog.memsql.com/cloud/
- The first article from the series about data modeling, request optimization, and optimizing AWS RedShift.
https://aws.amazon.com/blogs/big-data/amazon-redshift-engineerings-advanced-table-design-playbook-preamble-prerequisites-and-prioritization/
https://slack.engineering/syscall-auditing-at-scale-e6a3ca8ac1b8#.kijsn2qlf
- The detailed description of Amazon’s approach to development and designing architectures. The document covers main steps during AWS deployment.
http://d0.awsstatic.com/whitepapers/architecture/AWS_Well-Architected_Framework.pdf