What is wash trade detection in NFT analytics?

It detects fake sales where the same asset is traded between related wallets to simulate demand. We use graph analysis of transactions and funding source tracing.

How is rarity score calculated for a token?

Two main approaches: statistical rarity (sum of inverse trait frequency) and information content rarity (sum of log probabilities). The latter is more accurate for uneven distributions.

What data sources are used for NFT analytics?

Primary sources include on-chain events from marketplaces (Seaport, Blur, LooksRare), data via The Graph or self-hosted indexers (Ponder, Envio), and aggregators like Reservoir API.

How long does it take to develop an MVP of an NFT analytics platform?

An MVP for one network (Ethereum) with basic metrics and UI takes 6–8 weeks. A full multi-chain platform with wash trade detection and portfolio analytics takes 3–4 months.

Which metrics are critical for evaluating an NFT collection?

Floor price, trait floor, wash trade adjusted volume, holder distribution, listing depth, and diamond hands ratio. Each metric requires custom aggregation logic on historical data.

What is wash trade detection in NFT analytics?

It detects fake sales where the same asset is traded between related wallets to simulate demand. We use graph analysis of transactions and funding source tracing.

How is rarity score calculated for a token?

Two main approaches: statistical rarity (sum of inverse trait frequency) and information content rarity (sum of log probabilities). The latter is more accurate for uneven distributions.

What data sources are used for NFT analytics?

Primary sources include on-chain events from marketplaces (Seaport, Blur, LooksRare), data via The Graph or self-hosted indexers (Ponder, Envio), and aggregators like Reservoir API.

How long does it take to develop an MVP of an NFT analytics platform?

An MVP for one network (Ethereum) with basic metrics and UI takes 6–8 weeks. A full multi-chain platform with wash trade detection and portfolio analytics takes 3–4 months.

Which metrics are critical for evaluating an NFT collection?

Floor price, trait floor, wash trade adjusted volume, holder distribution, listing depth, and diamond hands ratio. Each metric requires custom aggregation logic on historical data.

Developing an NFT Analytics Platform: From Indexing to Metrics

We design and develop full-cycle blockchain solutions: from smart contract architecture to launching DeFi protocols, NFT marketplaces and crypto exchanges. Security audits, tokenomics, integration with existing infrastructure.

8+Years of workmore info 900+Completed projectsmore info 100+In house employeesmore info 19+Partnersmore info

Services we offer

Showing 1 of 1All 1305 services

Developing an NFT Analytics Platform: From Indexing to Metrics

Complex

from 1 week to 3 months

Frequently Asked Questions

Blockchain Development Services

Discuss your blockchain project

Free consultation — we will show how blockchain can solve your challenge

Get a quote

We will estimate the budget and timeline for your blockchain project

Blockchain Development Stages

Latest works

B2B ADVANCE company website development
1357
Development of a web application for FEEDME
1250
Website development for BELFINGROUP
955
Development of an online store for the company FURNORO
1188
B2B Advance company logo design
646
Development of a web application for Enviok
926

Show more works

Developing an NFT Analytics Platform

Note: when a client comes to us with a request for NFT analytics, the first pain point is almost always the same: data is scattered across a dozen marketplaces, each providing its own format, and the price for a specific token may be absent. Over 5 years, we've developed an approach that solves these problems turnkey: from on-chain event indexing to portfolio metrics and wash trade detection. In a typical project, we process data for 10,000+ collections, indexing up to 500,000 transactions daily. The NFT analytics market is valued at $500 million, and proper architecture saves up to 30% on infrastructure costs. Order NFT analytics platform development — we'll prepare the architecture for your tasks.

NFT analytics is more complex than DeFi analytics in one specific aspect: each token has a unique price. In DeFi, a Uniswap pool provides a clear price feed. In NFT, you need to value an asset whose last sale was three months ago, and the floor is the floor of the whole collection, not that specific token with a rare trait. Building a correct valuation model is half the work.

Data Sources for NFT Analytics

On-chain Events

Basic events to index:

Transfer(address from, address to, uint256 tokenId) — for ERC-721
TransferSingle / TransferBatch — for ERC-1155
OrderFulfilled (Seaport 1.5) — sales via OpenSea
TakerBid / TakerAsk — LooksRare v2
EvProfit — Blur

According to the ERC-721 specification, the Transfer event must be emitted on any token transfer (OpenZeppelin ERC-721 implementation). The problem: each marketplace has its own events with its own structure. Seaport is the most complex; a single OrderFulfilled event can encode a bundle sale of multiple NFTs in one transaction with arbitrary ERC-20 tokens. Parsing this data requires full decoding of consideration and offer arrays by ABI.

The Graph vs Self-hosted Indexing

The Graph is the obvious choice to start. Existing subgraphs: OpenSea (unofficial), NFT sales aggregator subgraphs on the hosted service. Limitations: the hosted service is shutting down in favor of the decentralized network, where queries cost GRT. For high-traffic analytics, query costs become significant — a typical project generates 10,000+ queries per day, at $0.01 per query that's $100/day.

Self-hosted via Ponder or Envio — Ponder (TypeScript framework for on-chain indexing) lets you write event handlers as plain TypeScript, stores data in PostgreSQL. Envio — an analog focused on speed (written in OCaml/Rust). For a platform with custom metrics, a self-hosted indexer is preferable: full control over the data schema.

Dual approach: historical data — from Dune Analytics or Reservoir API (aggregates sales from all marketplaces), real-time — via WebSocket subscription to events through Alchemy or QuickNode.

Valuation Models and Metrics

Rarity Scoring

Standard formula — statistical rarity:

rarity_score(token) = Σ (1 / trait_frequency) for all traits

This is what rarity.tools does. Problem: it doesn't account for correlation between traits. A token with a rare combination of two common traits may be rarer than the simple formula shows.

Improved approach — information content rarity (IC score):

IC(trait) = -log2(P(trait))
rarity_score = Σ IC(trait_i)

Works more correctly with uneven distributions, especially when trait frequencies vary from 0.1% to 50%.

Price Metrics

Metric	Formula / Source	Application
Floor price	min(active listings)	Basic benchmark
Trait floor	min(listings with given trait)	Valuation of specific token
Wash trade adjusted volume	volume - suspected wash trades	Real volume
Holder distribution	unique wallets / total supply	Decentralization
Listing depth	number of listings by price levels	Liquidity profile
Diamond hands ratio	% holders > 6 months	Retention

Average daily trading volume on top collections reaches 500 ETH, and the typical marketplace fee is 2.5% of the sale amount. Our wash trade detection implementation shows about 90% accuracy, allowing us to filter out up to 30% of fake volume on individual collections. Investments in NFT analytics can pay off by identifying hidden trends.

How to Detect Wash Trading?

One of the key features of an analytics platform — wash trade detection. Patterns for detection:

The same addresses buy and sell among themselves (transaction graph with cycles)
Sales 1–3 blocks after purchase at non-market price
Buyer funded from the same source as the seller (Tornado Cash / mixer, or direct transfer)
Repeated patterns: A→B→A→B with price increases

Implemented via graph analysis on addresses — Neo4j or built-in graph in DuckDB are efficient enough. For on-chain heuristics, use from/to in Transfer events plus funding source analysis via transaction tracing (trace_transaction in Geth/Erigon). Wash trade detection can save up to $10,000 on incorrectly valued collections.

Technical Stack of the Platform

Indexing Infrastructure

Ethereum node (Erigon) 
  → Ponder indexer (TypeScript)
  → PostgreSQL (TimescaleDB extension for time-series)
  → Redis (cache for floor prices, trending collections)
  → ClickHouse (analytical aggregates, OLAP queries)

TimescaleDB is critical for time-series metrics: continuous aggregates allow calculating hourly/daily OHLCV without full recalculation on each query. ClickHouse is justified at volumes > 100M events — analytical queries on it are 10–100x faster than PostgreSQL.

API Layer

GraphQL via Hasura on top of PostgreSQL — sufficient for most queries. Custom resolvers via Hasura Actions for complex calculations (rarity score, wash trade score).

For real-time data — WebSocket via Hasura subscriptions or custom server on Node.js with pub/sub via Redis Streams.

Enrichment Pipeline

NFT metadata is not always on-chain. A pipeline is needed:

From tokenURI() of the contract, get the URL (IPFS CID or HTTP)
Fetch metadata from IPFS gateway / HTTP
Parse attributes array
Store in PostgreSQL with computed rarity score
Update upon discovery of new tokens (Transfer from zero address)

Problem: IPFS fetch is unreliable. Need retry with exponential backoff, fallback to multiple gateways (Cloudflare, dweb.link, nftstorage.link), and timeout of 5–10 seconds.

Frontend

Next.js with App Router. Key pages:

Collection overview: floor chart (Recharts/TradingView lightweight), volume bars, holder distribution pie
Token detail: rarity rank, trait comparison, price history, similar sales
Wallet analytics: portfolio valuation, P&L by collection, unrealized gains
Market trends: trending by volume/floor change, new mints heatmap

For charts with large data volumes — TradingView Lightweight Charts (WebGL rendering) is faster than Recharts on 10k+ points.

Common Mistakes in NFT Analytics Development

Ignoring wash trading leads to 2–3x overestimation of volumes. Using only floor price without trait floor gives incorrect valuation of rare tokens. Choosing hosted The Graph for production can increase costs by 2–5x. Not accounting for IPFS timeouts breaks the pipeline. We avoid these mistakes thanks to experience.

What's Included in Turnkey Development?

Stage	Result	Timeline
Analytics and design	Data schema, stack selection, volume estimation	1–2 weeks
Indexing and pipeline	Working indexer for selected network, enrichment scripts	2–3 weeks
API and metrics	GraphQL endpoints, rarity scoring, wash trade detection	2–4 weeks
UI and dashboards	Collection, token, wallet, and trend pages	3–4 weeks
Testing and deployment	Integration tests, load testing, documentation	1–2 weeks

The final project includes: API documentation, access rights to indexers, client team training, and support during launch.

Why Choose Us?

We have delivered over 30 projects in the Web3 space, including analytics for NFT marketplaces and DeFi. Our experience spans over 5 years; engineers hold certifications in Solidity and Rust. We guarantee transparent timeline estimation and cost-efficient architecture without overpaying for infrastructure. Get a consultation on your task — we'll evaluate the project and propose an architecture for your budget. Contact us to discuss details.

Why does NFT marketplace development require a comprehensive approach?

We see that at first glance, an NFT contract looks simple: ERC-721, mint(), IPFS for metadata — that's it. In practice, it's this 'simplicity' that hides most problems — from bots buying out the entire mint in the first block to broken royalties on the secondary market. We often hear: Make a collection like others in a week — and a month later it turns out gas has tripled due to an unoptimized for loop, or OpenSea cannot see metadata after reveal. We know each of these pitfalls and build processes to avoid them.

Over 5 years of working with blockchains, we have implemented 40+ NFT projects, including marketplaces with dynamic attributes and cross-chain bridges. We have accumulated a library of proven templates — some of which we break down below.

Which standard to choose: `ERC-721` or `ERC-1155`?

ERC-721 — each token is unique, one owner. Suitable for collections where each NFT has individual attributes and a direct owner → tokenId mapping.
ERC-1155 — multi-token standard: one contract holds both fungible and non-fungible tokens. It uses balanceOf(address, tokenId) instead of ownerOf(tokenId). A single transaction can transfer multiple different tokens via safeBatchTransferFrom. This saves gas on bulk operations — important for game items, tickets, edition collections. ERC-1155 is 2–3× more gas-efficient than ERC-721 for batch transfers.

Criteria	`ERC-721`	`ERC-1155`
Token uniqueness	Each token is unique	One tokenId can have multiple copies
User balance	Only `ownerOf` (one)	`balanceOf(address, tokenId)`
Gas per transfer	~25,000 gas	~18,000 gas (batch even lower)
Batch operations	No native support	`safeBatchTransferFrom`
Ideal scenario	Art collections, PFPs	Games, tickets, editions

Specific case: a game project with 50 types of items, each with a supply of 10,000. ERC-721 — 500,000 unique tokens, huge overhead on mappings. ERC-1155 — 50 tokenIds, balanceOf per player. Gas per transfer is 2–3 times lower, contract deployment is cheaper. For such tasks, we use OpenZeppelin ERC-1155 with custom modifications.

Metadata: on-chain vs IPFS vs centralized

The standard route is tokenURI() returning a link to a JSON with fields name, description, image, attributes. Three storage options:

Centralized server — cheapest and most flexible. Risk: server goes down, company closes — NFT loses metadata. Not suitable for collections claiming long-term value.
IPFS + Pinning — content-addressed storage, the link is bound to the content hash. Pinata or NFT.Storage provide pinning. Important: IPFS does not guarantee availability by itself — an active pinning service is needed. If it shuts down, data may disappear if no one keeps a copy.
On-chain metadata — base64-encoded SVG or JSON directly in tokenURI. Maximum reliability, but expensive: for a collection of 10,000 tokens, gas costs may exceed $5,000. Suitable for generative art projects where visuals are generated from on-chain attributes (Nouns, Loot).

For most collections, we choose IPFS with Pinata for images + on-chain attributes for traits — a good balance. We validate files against a JSON Schema before upload; a typical mistake is unescaped quotes, causing marketplaces to display a blank screen.

Typical JSON metadata format

{
  "name": "Token #1",
  "description": "A unique NFT",
  "image": "ipfs://QmHash/image.png",
  "attributes": [{"trait_type": "Background", "value": "Red"}]
}

Dynamic NFT: metadata that changes

Dynamic NFT updates metadata in response to external events — match results, character levels, real-world data via Chainlink. Architecturally, it's a combination: the smart contract stores state → tokenURI() generates metadata from the state on-chain. Caching problem: OpenSea and other marketplaces aggressively cache. The standard invalidation mechanism is a MetadataUpdate(tokenId) event from ERC-4906. OpenSea listens to this event and clears the cache. Without it, updated metadata may not appear for weeks.

Chainlink Automation (formerly Keepers) for automatically updating state on the contract on a schedule or condition — a standard solution for dynamics.

How to protect mint from bots?

Allowlist via Merkle tree — standard. The list of addresses is hashed into a Merkle root, stored in the contract. During mint, the user provides a Merkle proof — the contract verifies without storing the full list. We use OpenZeppelin MerkleProof library.

Reveal mechanism — on mint, a placeholder is issued; real traits are revealed after the sale ends. Otherwise, bots can scan pending transactions and snipe rare traits via frontrunning. But reveal requires a commitment scheme — the random seed must be fixed before mint or use Chainlink VRF.

Chainlink VRF for fair randomization of traits. VRF request at mint → callback with verifiable random number → assign traits. This adds ~2 transactions and latency but guarantees fairness. Chainlink VRF v2.5.

Rate limiting — require(mintedPerWallet[msg.sender] < maxPerWallet). Does not protect against multi-wallets but raises attack cost. For premium projects, we often add proof-of-work directly in the contract (via EIP-2612 signatures).

Royalties: the real market state

ERC-2981 — on-chain royalty standard. The contract returns (recipient, amount) for any sale price via royaltyInfo(tokenId, salePrice). Marketplaces query this on each sale. Problem: adherence to royalties is voluntary for marketplaces. Blur launched with zero royalties, triggering a wave of other platforms. The situation has partially stabilized: OpenSea supports ERC-2981, Blur added optional ones. Royalty payments can represent 5–10% of secondary sale volume, so getting them right matters.

Attempts to enforce royalties on-chain by restricting transfers only to approved marketplaces (operator filtering) were proposed by OpenSea via OperatorFilterRegistry. This breaks composability — you cannot transfer an NFT through a custom contract. Most serious projects have abandoned this approach. For projects where royalties are critical, we build a custom marketplace within the ecosystem plus an incentive structure for users to trade there.

Lazy minting and gas-free mint

Gas-free mint via signature: the creator signs a voucher (tokenId, tokenURI, price, signature), the buyer provides the voucher in mint() — the contract verifies the signature via ECDSA.recover() and mints. Works on OpenSea via their Seaport protocol. Seaport is an optimized contract with minimal gas usage. Understanding its mechanics is important when integrating custom marketplace logic.

Stack for NFT projects

Contracts: Solidity 0.8.x, OpenZeppelin ERC721Enumerable or ERC721A (Azuki) for gas-optimized batch mint, ERC1155 from OpenZeppelin
VRF and automation: Chainlink VRF v2.5, Chainlink Automation
Storage: Pinata (IPFS pinning), NFT.Storage, Arweave for permanent storage
Marketplace: OpenSea Seaport protocol, custom integration
Frontend: wagmi v2 + viem, RainbowKit for wallet connection, React + TypeScript

Development process

Mint mechanics design — allowlist, public sale, price curve (Dutch auction or fixed), limits per wallet
Contracts — with Foundry fuzz tests on mint limits, Merkle proof verification, royalty calculations
IPFS deployment — upload metadata and images before reveal, pin on at least two services
Reveal — if using Chainlink VRF, test on testnet mandatory: VRF subscription must be funded with LINK tokens
Marketplace integration — verify collection on OpenSea, configure royalties, test MetadataUpdate events
Deployment and monitoring — Tenderly for reentrancy detection, Etherscan API for contract verification, set up event alerts

Deliverables

Source code of smart contracts (Solidity, Rust for Solana) with comments
Test suite (Foundry/Hardhat) with ≥90% coverage
Deployment documentation and integration instructions
Access to pinning services (Pinata/Pinfluence)
Metadata generation scripts (Python/JS)
Support during marketplace verification
30 days of technical support after deployment

Timeline

Task type	Approximate timeline
Basic ERC-721 without reveal	from 2 weeks
NFT collection with allowlist, reveal, VRF	from 5 weeks
ERC-1155 with marketplace and royalties	from 6 weeks
Dynamic NFT with external data	from 8 weeks

Cost is calculated individually after auditing your task. Send a brief with your project description — we will provide a transparent estimate within 3 business days. For regular clients, there is a flexible discount system on batch orders. If you need a gas-optimized contract, order a free gas analysis. Get a consultation on marketplace architecture — leave a request, and we will evaluate your project in three days.

Developing an NFT Analytics Platform: From Indexing to Metrics

Blockchain Development Services

Latest works

Developing an NFT Analytics Platform

Data Sources for NFT Analytics

On-chain Events

The Graph vs Self-hosted Indexing

Valuation Models and Metrics

Rarity Scoring

Price Metrics

How to Detect Wash Trading?

Technical Stack of the Platform

Indexing Infrastructure

API Layer

Enrichment Pipeline

Frontend

What's Included in Turnkey Development?

Why Choose Us?

Why does NFT marketplace development require a comprehensive approach?

Which standard to choose: ERC-721 or ERC-1155?

Metadata: on-chain vs IPFS vs centralized

Dynamic NFT: metadata that changes

How to protect mint from bots?

Royalties: the real market state

Lazy minting and gas-free mint

Stack for NFT projects

Development process

Deliverables

Timeline

Which standard to choose: `ERC-721` or `ERC-1155`?