VaryOn Meridian
/ Data Quality
Data Layer“Data quality and value assessment for AI agent consumption”
Purpose
Meridian evaluates the quality and value of external data sources consumed by AI agents during inference-time operations. It provides runtime gating via middleware interception, including Model Context Protocol (MCP) server integration, preventing low-quality data consumption at the OS network stack level before socket creation.
In markets where AI agents autonomously purchase and consume third-party data - APIs, feeds, datasets - Meridian provides a bounded, machine-consumable numerical index (0-100) with formal mathematical properties, delivered in real time (<10ms cached) to enable automated procurement decisions and dynamic pricing.
By intercepting tool calls prior to external API execution and denying calls to data sources scoring below configurable thresholds, the system reduces network traffic by 20-40%, eliminates downstream processing of low-quality data, and ensures deterministic agent behavior with guaranteed quality floors.
Core Formula
Where S', Q', D', F' = max(dimension, ε) are floored dimensions, ε = 0.01 prevents multiplicative annihilation, W = α + β + γ + δ = 1.0, with default weights α=0.35, β=0.25, γ=0.25, δ=0.15.
Aggregation Rationale
The weighted geometric mean provides three critical mathematical properties: (1) Non-compensatory behavior - a dimension at floor ε = 0.01 produces a multiplicative penalty that cannot be offset by strength in others; (2) Imbalance penalty - balanced dimensions score higher than imbalanced ones with the same arithmetic mean; (3) Constant elasticity - a 1% improvement in any dimension produces a predictable percentage improvement in the composite score.
This aggregation method follows precedent from the UN Human Development Index and reflects the economic reality that data value is multiplicative. A source with S = ε (commodity), regardless of perfect quality (Q = D = F = 1.0), yields Score = 4.27, not the 75.75 that arithmetic mean would produce.
For numerical stability, the system computes in log-space: log(Score/100) = (1/W)[α·log(S') + β·log(Q') + γ·log(D') + δ·log(F')], with precomputed logarithmic values cached alongside the composite score.
Scoring Dimensions
Scarcity
35%Measures inverse availability of functionally equivalent substitute sources at procurement time. The highest-weighted dimension (α=0.35) because monopoly data commands premium pricing.
Logistic function where n = count of equivalent alternatives from independent registry, k = 1.5 (steepness), n₀ = 3 (midpoint where S = 0.50).
- n=0 (monopoly) -> S=0.989 near-maximum scarcity
- n=3 (midpoint) -> S=0.500 with maximum rate of change
- n>=6 (commodity) -> S~ε floor applied
- Equivalence via schema similarity cos(embed) >= τ₁ and field overlap >= τ₂
- Independent registry prevents provider gaming
Quality
25%Arithmetic mean of sub-dimensions because partial quality is additively valuable. A dataset with high accuracy but moderate freshness remains useful.
Weights: accuracy 30%, freshness 25%, completeness 20%, structure 15%, verification 10%.
- Accuracy Q_a = 1 - error_rate from verification samples
- Freshness Q_f = exp(-λ × age) with domain-specific half-lives
- Threat intel t½ = 24hr, Financial t½ = 5hr, B2B contacts t½ = 30 days
- Structure from 1.0 (typed+semantic) to 0.1 (unstructured)
- Verification from 1.0 (regulatory-certified) to 0.2 (unverified)
Decision Impact
25%Single-source marginal degradation using O(1) computation, NOT O(2ⁿ) Shapley enumeration. Measures actual influence on agent outcomes.
E = essentiality gate [0.05, 1.0], D_e = economic leverage, D_u = uniqueness via Spearman correlation.
- Essentiality gate: soft [0.05, 1.0] or hard binary {0, 1}
- Economic leverage D_e = log₁₀(1 + cost) / log₁₀(1 + C_max)
- Uniqueness D_u = 1 - max correlation with alternatives
- O(1) bounded computation vs O(2ⁿ) Shapley infeasibility
- Leave-one-out protocol with bounded K alternatives
Defensibility
15%Legal protection and competitive moat. Despite lowest weight (δ=0.15), floor penalty still applies - non-compliant data creates unbounded liability.
Weights: exclusivity 40%, legal 40%, network effects 20%.
- Exclusivity F_r from 1.0 (sole-source) to 0.2 (public domain)
- Legal F_l from 1.0 (regulated+contractual) to 0.0 (public domain)
- Network effects F_n from 1.0 (proprietary ecosystem) to 0.2 (none)
- Composite score with inter-dimension correlation penalty
- Floor penalty at ε=0.01 yields 68.4% score reduction
Tier System
Gaming Resistance
Edge Cases
Cold Start (New Source)
- Provisional score P* using available dimensions only
- Uncertainty factor U = 1/(1 + observations) applied
- Score = P* × (0.5 + 0.5U) ensures conservative initial assessment
Zero Alternative Sources
- Scarcity S -> 0.989 (near-maximum but not 1.0)
- Sigmoid saturates gracefully with no division-by-zero
- Monopoly premium reflected in pricing tier
Missing Dimension
- Apply dimensional floor ε = 0.01 instead of zero
- Flag as provisional with specific missing indicator
- Multiplicative penalty ensures conservative scoring
Runtime Gating Failure
- Fallback to cached score if fresh computation exceeds timeout
- Degraded mode allows transaction with audit flag
- Asynchronous score update for next request
Worked Example
Financial Market Data Provider (Options Chain)
MCP server intercepts agent tool-call at middleware layer. Score computed in <10ms from cache. Gold tier triggers automatic procurement with usage-based pricing at $0.012 per query. Transaction logged to audit trail with dimensional breakdown for billing transparency.
Use Cases
Meridian could enable data quality scoring across 60+ enterprise applications where AI agents would need to evaluate and select data sources autonomously.
Find a use case for your industry
Active Agent Markets
Mature agent ecosystems with immediate adoption potential
DEX Aggregator Optimization
DeFi & CryptoScore liquidity pool data quality from Uniswap, Curve, Balancer, PancakeSwap. Evaluate slippage predictions and gas cost estimates.
MEV Bot Coordination
DeFi & CryptoAssess mempool data quality from multiple Ethereum nodes. Score flashloan opportunity data from lending protocols.
Cross-Chain Bridge Intelligence
DeFi & CryptoEvaluate oracle price feeds (Chainlink, Band Protocol, API3). Score bridge liquidity and security audit data.
Yield Farming Automation
DeFi & CryptoAssess APY data accuracy across DeFi protocols. Score impermanent loss predictions from analytics providers.
Programmatic Ad Bid Optimization
Advertising TechnologyReal-time scoring of impression quality from SSPs. Evaluate viewability predictions from multiple vendors.
Ad Fraud Detection Networks
Advertising TechnologyScore click/impression authenticity from verification services. Assess bot traffic patterns from multiple detection systems.
Audience Data Marketplace
Advertising TechnologyEvaluate first-party vs third-party segment quality. Score cookie match rates and identity graph accuracy.
High-Frequency Trading Execution
Capital MarketsAssess order book depth data from multiple exchanges. Score market microstructure signals for alpha generation.
Crypto Arbitrage Networks
Capital MarketsEvaluate price feed latency across CEX and DEX platforms. Score cross-exchange transfer time estimates.
Smart Order Routing
Capital MarketsAssess venue liquidity and execution quality. Score payment for order flow arrangements.
NFT Trading Automation
Web3 & MetaverseEvaluate metadata accuracy and rarity calculations. Score floor price predictions from analytics platforms.
GameFi Asset Optimization
Web3 & MetaverseAssess in-game economy data and token rewards. Score guild performance and scholarship opportunities.
Metaverse Real Estate Valuation
Web3 & MetaverseEvaluate location traffic data across virtual worlds. Score development potential and rental yields.
API Gateway Intelligence
Cloud InfrastructureAssess endpoint reliability, latency, and rate limits. Score API documentation quality and versioning.
Serverless Function Orchestration
Cloud InfrastructureEvaluate cold start times and resource availability. Score function performance across cloud providers.
Enterprise Automation
Enterprise systems ready for agent integration
Freight Capacity Matching
Supply ChainScore carrier reliability and on-time performance. Evaluate real-time capacity from load boards.
Procurement Bot Networks
Supply ChainAssess supplier catalog accuracy and pricing. Score vendor compliance and sustainability data.
Dropshipping Inventory Management
E-commerceEvaluate stock levels across multiple wholesalers. Score shipping time estimates and costs.
Commodity Price Discovery
CommoditiesAssess spot and futures price data from exchanges. Score weather and crop yield predictions.
Container Tracking Intelligence
LogisticsEvaluate AIS vessel data and port congestion. Score ETA predictions from shipping lines.
Alternative Data Alpha Generation
Asset ManagementScore satellite imagery for economic indicators. Evaluate social sentiment from multiple NLP providers.
Credit Risk Data Aggregation
Credit & LendingAssess bureau data quality and coverage. Score alternative credit signals (utility, rental, telecom).
RegTech Compliance Scoring
Regulatory TechnologyEvaluate KYC/AML data from screening providers. Score transaction monitoring alerts accuracy.
Insurance Claims Automation
InsuranceAssess damage estimates from photo AI analysis. Score weather data for catastrophe claims.
Robo-Advisory Data Fusion
Wealth ManagementEvaluate market data for portfolio rebalancing. Score ESG ratings from multiple providers.
Clinical Trial Site Selection
PharmaceuticalsScore patient recruitment potential by geography. Evaluate investigator performance history.
Real-World Evidence Aggregation
Healthcare AnalyticsAssess EHR data quality for outcomes research. Score patient registry completeness.
Drug Supply Chain Verification
Pharmaceutical DistributionEvaluate serialization data for DSCSA compliance. Score temperature excursion data for cold chain.
Medical Device IoT Monitoring
Medical DevicesAssess sensor data reliability from implantables. Score predictive maintenance signals.
Virtual Power Plant Orchestration
Renewable EnergyScore distributed energy resource availability. Evaluate demand response capability predictions.
Grid Balancing Automation
Grid ManagementAssess renewable generation forecasts accuracy. Score real-time pricing signals from ISOs.
Oil & Gas Exploration Data
Oil & GasAssess seismic survey quality and resolution. Score well log data from drilling operations.
Dynamic Pricing Intelligence
RetailScore competitor price scraping accuracy. Evaluate demand elasticity predictions.
Inventory Optimization Networks
Retail OperationsAssess stock level data across channels. Score demand forecast accuracy by SKU.
Customer Data Platforms
Marketing TechnologyEvaluate identity resolution accuracy. Score behavioral prediction model outputs.
Product Review Authenticity
E-commerceAssess review verification from multiple sources. Score sentiment analysis accuracy.
GDS Data Aggregation
TravelScore flight availability from Amadeus, Sabre. Evaluate pricing accuracy and fare rules.
Revenue Management Systems
HospitalityAssess competitor rate shopping data. Score demand forecast for dynamic pricing.
Mobility-as-a-Service Platforms
TransportationEvaluate multimodal routing options. Score real-time transit data accuracy.
Logistics Network Optimization
LogisticsEvaluate last-mile delivery partner performance. Score route optimization predictions.
MLS Data Syndication
Real EstateScore listing accuracy across platforms. Evaluate comparable sales data quality.
Automated Valuation Models
Real EstateAssess property characteristic data. Score neighborhood trend predictions.
Construction Progress Monitoring
ConstructionEvaluate drone imagery and site sensors. Score subcontractor performance data.
Property Management Automation
Real EstateAssess maintenance request prioritization. Score tenant screening data sources.
Smart Building Optimization
Building ManagementEvaluate HVAC sensor data quality. Score occupancy predictions for energy management.
Emergency Response Coordination
Public SafetyAssess multi-agency communication quality. Score incident prediction accuracy.
Defense Intelligence Fusion
DefenseEvaluate OSINT source credibility. Score threat assessment data.
Border Security Systems
Border ControlAssess biometric matching accuracy. Score cargo screening data quality.
Future Markets
Next-generation markets developing agent capabilities
Precision Medicine Data Markets
GenomicsEvaluate genomic data quality and coverage. Score phenotype-genotype association data.
Carbon Credit Verification
SustainabilityEvaluate offset project data and additionality. Score third-party verification reports.
EV Charging Network Optimization
Electric VehiclesEvaluate charger availability and pricing data. Score grid capacity at charging locations.
Social Commerce Analytics
Social MediaEvaluate influencer performance metrics. Score viral trend predictions.
Autonomous Vehicle Networks
AutomotiveAssess V2V communication data quality. Score HD map updates and road conditions.
LLM Training Data Markets
Artificial IntelligenceScore dataset quality, bias, and licensing. Evaluate synthetic data generation quality.
Autonomous AI Agent Coordination
Artificial IntelligenceAssess tool/API reliability for agent selection. Score agent reputation in decentralized networks.
Edge Computing Resource Markets
Cloud InfrastructureEvaluate edge node performance metrics. Score workload placement optimization data.
Quantum Computing Access
QuantumAssess quantum processor availability. Score algorithm performance predictions.
Satellite Data Markets
Space TechnologyEvaluate earth observation data quality. Score analytics provider accuracy.
Smart City Sensor Networks
GovernmentScore IoT device data reliability. Evaluate crowd-sourced city data.