Sunbird Obsrv
  • Introduction
    • The Value of Data
    • Data Value Chain
    • Challenges
    • The Solution: Obsrv
  • Core Concepts
    • Obsrv Overview
    • Key Capabilities
    • Datasets
    • Connectors
    • High Level Architecture
    • Tech Stack
    • Monitoring
  • Explore
    • Roadmap
    • Case Studies
      • Agri Climate Advisory
      • Learning Analytics at Population Scale
      • IOT Observations Infra
      • Data Driven Features in Learning Platform
      • Network Observability
      • Fraud Detection
    • Performance Benchmarks
  • Guides
    • Installation
      • AWS Installation Guide
      • Azure Installation Guide
      • GCP Installation Guide
      • OCI Installation Guide
      • Data Center Installation Guide
    • Dataset Management APIs
    • Dataset Management Console
    • Connector APIs
    • Data In & Out APIs
    • Alerts and Notification Channels APIs
    • Developer Guide
    • Example Datasets
    • Connectors Developer Guide
      • SDK Assumptions
      • Required Files
        • metadata.json
        • ui-config.json
        • metrics.yaml
        • alerts.yaml
      • Obsrv Base Setup
      • Dev Requirements
      • Interfaces
        • Stream Interfaces
        • Batch Interfaces
      • Classes
        • ConnectorContext Class
        • ConnectorStats Class
        • ConnectorState Class
        • ErrorData Class
        • MetricData Class
      • Verifying
      • Packaging Guide
      • Reference Implementations
    • Coming Soon!
  • Community
  • Previous Versions
    • SB-5.0 Version
      • Overview
      • USE
        • Release Notes
          • Obsrv 2.0-Beta
          • Obsrv 2.1.0
          • Obsrv 2.2.0
          • Obsrv 2.0.0-GA
          • Obsrv 5.3.0-GA
          • Release V 5.1.0
          • Release V 5.1.2
          • Release V 5.1.3
          • Release V 5.0.0
          • Release V 4.10.0
        • Installation Guide
        • Obsrv 2.0 Installation Guide
          • Getting Started with Obsrv Deployment Using Helm
        • System Requirements
      • LEARN
        • Functional Capabilities
        • Dependencies
        • Product Roadmap
        • Product & Developer Guide
          • Telemetry Service
          • Data Pipeline
          • Data Service
          • Data Product
            • On Demand Druid Exhaust Job
              • Component Diagram
              • ML CSV Reports
              • Folder Struture
          • Report Service
          • Report Configurator
          • Summarisers
      • ENGAGE
        • Discuss
        • Contribute to Obsrv
      • Raise an Issue
  • Release Notes
    • Obsrv 1.1.0 Beta Release
    • Obsrv 1.2.0-RC Release
Powered by GitBook
On this page

Was this helpful?

Edit on GitHub
  1. Introduction

Challenges

Challenges in implementing and managing a robust Data Value Chain

Despite the availability of numerous scalable and dependable technologies in the data space, the combination of these technologies often results in a fragile end solution. Only select big-tech companies have successfully mastered the processes of generating, consuming, and utilizing data reliably at any scale—examples include Google, Facebook, Netflix, Amazon, and LinkedIn. The existence of reliable data platforms facilitating a robust Data Value Chain has served as a significant distinguishing factor for these companies. And none of these companies have shared their end-to-end solutions with others, only a handful have released some of the tools they internally use, such as Facebook's contribution with Cassandra.

A pressing issue for many organizations is the substantial effort required to develop, operate or maintain an end-to-end data solution reliably.

The challenge of reliably managing the data value chain is growing for numerous companies, particularly as contemporary products generate substantial data volumes, even with a relatively modest user base.

Key Challenges

The challenges faced by most of the organizations in operating a data value chain mainly falls into one of the following four categories:

  1. Time: Significant amount of time mis-spent in managing the solution rather than in leveraging data's full potential

  2. Cost: High upfront CapEx and running costs

  3. Capability: Challenges to build, manage and operate complex data technologies & systems

  4. Risk: Business & technical risks due to propietary & fragmented solutions, rigid & less reliable systems

Listed below are some challenges that are often faced by organizations with data platforms & solutions:

  • Comprehensive, ready-to-use solutions for implementing the entire data value-chain are scarce. In many instances, organizations resort to employing extensive teams of Data and DevOps engineers to build these solutions.

  • Data Analysts consistently grapple with challenges related to data integrity, quality, and accessibility, primarily due to the dynamic and evolving nature of data.

  • Data Engineers spend substantial time resolving reliability issues due to the agile nature of data and interoperability challenges between components of data platforms.

  • Inherent complexities of data platforms, when exposed, increase timelines for new pipeline creation, increasing the lead times for generating data insights.

  • Organizations find themselves compelled to transition to a new data solution as they grow and have diverse data use-cases.

  • Majority of existing managed solutions, if not all, bind users to proprietary data tools and formats, creating a vendor lock-in.

PreviousData Value ChainNextThe Solution: Obsrv

Last updated 1 year ago

Was this helpful?