Alex Woodie, Datanami Managing Editor, and Doug Black, EnterpriseTech Managing Editor gives brief overview of the conference, its goals, and resources that attendees have for gaining additional insight into the complex and evolving world where enterprises increasingly leverage High Performance Computing (HPC) to solve modern scaling challenges of the big data era.
Shklyar’s talk will take a close look at HPC environments across industries to identify key differences as well as areas where all advanced-scale infrastructures stand on common ground. From there, Shklyar will zero in on the common denominators and how to approach issues others have already solved, from software to storage.
Identifying Needs, Technology and Talent to Deliver on the ROI Promise of Big Data The challenge for organizations has been the same for decades: how to get ahead of the competition. Today organizations are looking at the data for a competitive advantage and are trying to capitalize on their ability to monetize big data. Key problems are keeping companies from finding consistency in their returns, including a shortage of skilled practitioners, technology that is still iterating, and a big data fog of war that can prevent companies from identifying needs. Getting through these challenges requires a roadmap. We’ll examine winning strategies to show how leaders are moving the return on investment needle for consistent returns.
Platforms like Facebook and LinkedIn use social graphs to help us find personal connections – however, at Ancestry, we were able to build a family history graph that reveals the complicated connections between billions of people, location, and tens of thousands of historical events. Through the power of machine learning, Ancestry the global leader in family history and consumer genomics, is creating its Big Tree, a knowledge graph that stitches together 10 petabytes of structured and unstructured data from 20 billion records, more than 80 million family trees and 10 billion people into one Big Tree. Utilizing the scalability of cloud computing, artificial intelligence, big data technology, AI based Search technology, and distributed stitching engines, the Big Tree is updated in real time – at a rate of 400 changes per second and 35 million changes a day – as users input data or make edits, becoming more powerful and unearthing new knowledge of familial connections with every update. In this keynote address, Ancestry data scientist Tyler Folkman, will present an informative explanation of how the company is leveraging its Big Data and machine learning capabilities to stitch together the largest family history graph, and how it will impact and reveal the organic relationships between people, locations and events, which could prove a powerful force for greater empathy and understanding as we understand how we relate to the rest of humanity.
Big Data challenges are pushing the limits of what organizations can do. Getting from Point A to Point ROI can be a challenging journey. From data models to solving scale, every step of the way presents new challenges that need to be solved. In this session, we gather industry leaders from across the spectrum of industries to discuss their requirements and challenges and how they are solving them to deliver solutions that are driving results. Moderator, and Datanami Editor, Alex Woodie, will seek to find patterns in these requirements and work to contrast the similarities (and differences) of these challenges to provide context for the convergence of needs. Moderator: Alex Woodie, Managing Editor, Datanami Panelists: Sakthivel Madhappen, IT Infrastructure Operations Manager, Inova Health Systems; Bob Neuerburg, Sr. Enterprise Architect, Expedia, Inc.; Michael Chupa, HPC Engineer, Bristol-Myers Squibb; Tassos Sarbanes, Data Architect, Credit Suisse;
Today’s enterprise architectures can often be composed of a myriad of heterogeneous devices. Bring-Your-Own-Device policies, vendor diversification, and the transition to the cloud all contribute to a sprawling infrastructure whose complexity and scale can only be addressed by using modern distributed data processing systems. In this session, we describe the system that Capital One has built to collect, clean, and analyze the security-related events occurring within its digital infrastructure. Raw data from each component is collected and pre-processed using Apache NiFi flows. This raw data is then written into an Apache Kafka cluster, which serves as the primary communications backbone of our platform. The raw data is then parsed, cleaned, and enriched in realtime via Apache Metron and Apache Storm. This refined data is then ingested into ElasticSearch, allowing operations teams to detect and monitor events as they occur. The refined data is also transformed into the Apache ORC data format, and stored in Amazon S3, allowing data scientists to perform long-term, batch-based analysis. We discuss the challenges involved with architecting and implementing this system include issues surrounding data quality, performance tuning, and the impact of additional financial regulations relating to data governance. Finally, we describe the result of our efforts and the value that our data platform brings to Capital One as a business.
IDC’s HPC team recently completed an in-depth survey with 63 leading cyber security experts from the US and Europe that explored a number of key cyber security issues including the use of big data analysis as a cyber security tool, cyber security team fit within an organization, cyber security frameworks, key performance indicators, breach plan development, and overall insights on cyber security best practices. This talk will provide a summary of that project with a special emphasis on the status and prospects of future cyber security efforts particular to the HPC sector.
Moderated by EnterpriseTech Managing Editor, Doug Black, this panel will examine the different ways that enterprises can “on-ramp” to scale, whether it be cloud, or on-premises HPC. The panel will look at the different approaches towards achieving performance that scales, and the challenges that these different paths face. Moderator: Doug Black, Managing Editor, EnterpriseHPC Panelist: Fred Streitz, Director of the HPC Innovation Center, LLNL; Chris Mustain, VP, Innovation Policy and Programs, Council on Competitiveness; Larry Patterson, Director, Advanced Computing Technologies, GulfStream Aerospace; Donald Burr, Director, AIG;