Which stack should I choose for face tracking in AR?

For high-precision depth map and realistic masks, choose ARKit (requires TrueDepth). For cross-platform solutions without a special sensor, use MediaPipe. ARCore is an option for Android without additional sensors. We help you decide during the audit phase.

How does ARKit process facial expressions?

ARKit returns 52 blend shape coefficients, each from 0 to 1, covering all facial muscles. Based on this data, you can drive avatar animations, trigger events, or recognize emotions via an ML classifier.

Can I use MediaPipe on iOS?

Yes, MediaPipe Face Landmark Task supports iOS via VisionImage. It works on devices with Neural Engine (iPhone 8 and newer). For 30fps live camera, we recommend modern flagships.

How long does face tracking integration take?

Basic integration with blend shape triggers takes 1-2 weeks. Adding a 3D mask and video recording takes 2-4 weeks. An ML expression classifier adds 2-3 weeks. Exact timelines depend on project complexity.

What are common mistakes in face tracking implementation?

Using ARKit on devices without TrueDepth causes FPS drops. Missing isSupported checks leads to crashes on older models. Ignoring camera resolution increases latency due to format mismatch.

Which stack should I choose for face tracking in AR?

For high-precision depth map and realistic masks, choose ARKit (requires TrueDepth). For cross-platform solutions without a special sensor, use MediaPipe. ARCore is an option for Android without additional sensors. We help you decide during the audit phase.

How does ARKit process facial expressions?

ARKit returns 52 blend shape coefficients, each from 0 to 1, covering all facial muscles. Based on this data, you can drive avatar animations, trigger events, or recognize emotions via an ML classifier.

Can I use MediaPipe on iOS?

Yes, MediaPipe Face Landmark Task supports iOS via VisionImage. It works on devices with Neural Engine (iPhone 8 and newer). For 30fps live camera, we recommend modern flagships.

How long does face tracking integration take?

Basic integration with blend shape triggers takes 1-2 weeks. Adding a 3D mask and video recording takes 2-4 weeks. An ML expression classifier adds 2-3 weeks. Exact timelines depend on project complexity.

What are common mistakes in face tracking implementation?

Using ARKit on devices without TrueDepth causes FPS drops. Missing isSupported checks leads to crashes on older models. Ignoring camera resolution increases latency due to format mismatch.

Face Tracking in AR: Precision, Animation, and Performance

TRUETECH is engaged in the development, support and maintenance of iOS, Android, PWA mobile applications. We have extensive experience and expertise in publishing mobile applications in popular markets like Google Play, App Store, Amazon, AppGallery and others.

8+Years of workmore info 900+Completed projectsmore info 100+In house employeesmore info 19+Partnersmore info

Development and support of all types of mobile applications:

Information and entertainment mobile applications

News apps, games, reference guides, online catalogs, weather apps, fitness and health apps, travel apps, educational apps, social networks and messengers, quizzes, blogs and podcasts, forums, aggregators

E-commerce mobile applications

Online stores, B2B apps, marketplaces, online exchanges, cashback services, exchanges, dropshipping platforms, loyalty programs, food and goods delivery, payment systems.

Business process management mobile applications

CRM systems, ERP systems, project management, sales team tools, financial management, production management, logistics and delivery management, HR management, data monitoring systems

Electronic services mobile applications

Classified ads platforms, online schools, online cinemas, electronic service platforms, cashback platforms, video hosting, thematic portals, online booking and scheduling platforms, online trading platforms

These are just some of the types of mobile applications we work with, and each of them may have its own specific features and functionality, tailored to the specific needs and goals of the client.

Services we offer

Showing 1 of 1All 1734 services

Face Tracking in AR: Precision, Animation, and Performance

Complex

~3-5 days

Frequently Asked Questions

Our competencies:

Free consultation

Book a free consultation if you have any questions. A dedicated specialist will advise you.

Cost calculation

If you know what exactly you need to develop, or you already have a ready-made technical task.

Development stages

Latest works

Development of a mobile application for FEEDME
858
Development of a mobile application for XOOMER
744
Development of a mobile application for RHL
1160
Development of a mobile application for ZIPPY
1034
Development of a mobile application for Affhome
968
Development of a mobile application for the FLAVORS company
562

Show more works

Why Face Tracking Is a Bottleneck in AR

A user opens an AR filter, but the mask doesn't match their expression — typical when face tracking is misconfigured. We encounter this in almost every second project. Face tracking is a mature technology, but its implementation requires precise calibration and the right stack choice. A wrong platform decision turns the app into a laggy mess. Our 5+ years of experience with 15+ face tracking projects let us guarantee latency under 50ms. The right stack choice cuts rework costs in half.

Problems We Solve

Three main challenges: expression latency, tracking loss on head rotation, and lack of response to facial expressions. We solve them by selecting the right technology and tuning parameters. Every project includes load testing on 5+ devices.

Stack and Precision: ARKit vs ARCore vs MediaPipe

According to Apple ARKit documentation, the TrueDepth camera achieves submillimeter accuracy. ARKit with TrueDepth provides a face depth map with millimeter precision. ARCore AugmentedFace and MediaPipe work on RGB cameras but perform worse on motion. Key parameter comparison:

Technology	Camera	Points	Depth Map	FPS (iPhone 12)
ARKit (TrueDepth)	Front TrueDepth	1220 vertices	Yes	60
ARCore AugmentedFace	RGB (any)	468 points	No	30
MediaPipe Face Landmark	RGB (any)	478 points	No	30-45 (with NE)

ARKit is 1.5x faster in FPS on flagships, while MediaPipe is a universal cross-platform solution. Depth map from ARKit is critical for realistic masks — it allows accurate texture overlay even on moving faces.

Performance on Different Devices

Device	ARKit (TrueDepth)	ARCore	MediaPipe
iPhone 14 Pro	60 FPS, stable	—	45 FPS
Google Pixel 7	—	30 FPS	30 FPS
Samsung Galaxy S23	—	30 FPS	30 FPS

How We Ensure Stable Face Tracking: A Case Study

On a recent social media AR filter project, the client faced 120ms latency on Android and lost tracking when the head turned beyond 30 degrees. We switched to ARKit on iOS (TrueDepth) and kept MediaPipe on Android with a custom fallback prediction. After optimizing blend shape processing (dedicated thread, reduced sampling rate to 30Hz for non-critical expressions), latency dropped to 45ms on iPhone and 55ms on latest Android flagships. Tracking loss decreased from 15% to under 2% of frames. The filter achieved 60 FPS on iPhone 12+ and 30 FPS on mid-range Android devices.

What ARKit Face Tracking Specifically Provides

ARFaceTrackingConfiguration requires iPhone X or newer (TrueDepth front camera). Returns ARFaceAnchor:

geometry — ARFaceGeometry with 1220 vertices and 2304 triangles. Real-time face mesh in meters. Updated ~30 times per second.
blendShapes — dictionary of 52 AR face blend shape coefficients. Each is Float from 0 to 1. Basis for face-driven animation and expression recognition.
leftEyeTransform, rightEyeTransform — position and orientation of each eye.

func session(_ session: ARSession, didUpdate anchors: [ARAnchor]) {
    guard let faceAnchor = anchors.first as? ARFaceAnchor else { return }

    let blinkLeft = faceAnchor.blendShapes[.eyeBlinkLeft]?.floatValue ?? 0
    let jawOpen = faceAnchor.blendShapes[.jawOpen]?.floatValue ?? 0

    // Trigger UI actions on blink/jaw open
    if blinkLeft > 0.7 { triggerAction() }
}

How We Ensure Stable Face Recognition on Android

For Android, we combine ARCore and ML Kit. ARCore AugmentedFace provides 468 points and basic emotions. ML Kit Face Detection does contour detection. For avatar animation, we use a custom classifier on TensorFlow Lite. Testing on 10+ devices guarantees stability. Head rotation issues are solved by tracking orientation: if tracking loses the face, we fall back to predicting the last known coordinates.

Our Development Process

Analytics — define use cases, choose platform.
Design — prototype blend shape processing.
Implementation — integrate SDK, configure, code sign.
Testing — verify on 5+ devices including different iPhone and Android models.
Deployment — publish to App Store / Google Play with guideline compliance.

Common Implementation Mistakes

Using ARKit on devices without TrueDepth — FPS drops.
Missing isSupported check — crashes on older models.
Ignoring camera resolution — latency due to format mismatch.
Unoptimized blend shape handlers — FPS down to 20.

We avoid these through code review and load testing at every stage.

Timelines and Pricing

Base integration with blend shape triggers: 1-2 weeks. Adding 3D mask and video recording: 2-4 weeks. ML expression classifier: additional 2-3 weeks. Cost is determined after analysis — contact us for a project assessment. We guarantee transparent pricing with no hidden fees.

What You Get

Source code with comments and documentation.
Demo application for testing.
Access to real-time logs and monitoring.
30 days of post-delivery support.

We ensure stable face tracking in your AR app. Request a tech audit — our engineers help from the first steps. Optimizing blend shapes early saves up to 40% development time. Contact us for a project review. Learn more about MediaPipe Face Landmark.

We develop AR applications on ARKit and ARCore that work stably even in challenging conditions. Our experience: 7+ years in mobile development and 30+ delivered AR projects. Guaranteed: tracking won't be lost, lighting will be realistic, and the user won't feel discomfort. Certified Apple and Google developers.

Why does tracking get lost and how to fix it?

ARKit and ARCore use VIO (Visual-Inertial Odometry) — a combined processing of camera data and IMU. Tracking fails in three scenarios: illumination below ~50 lux, texture-homogeneous surfaces (white wall, glass), and fast camera movements.

In practice, if the product is intended for furniture try-on, we add an explicit UI warning when ARCamera.TrackingState.limited(.insufficientFeatures). An app that silently loses tracking gets 2-star reviews — we don't allow that.

Plane detection is configured via ARWorldTrackingConfiguration.planeDetection = [.horizontal, .vertical]. Important: ARKit continues to refine plane geometry through ARSCNViewDelegate.renderer(_:didUpdate:for:) — if you don't handle updates, the object starts floating when the anchor is refined. Our team solves this at the architecture stage, not during testing.

AR Foundation: cross-platform with nuances

Unity AR Foundation is an abstraction layer over ARKit and ARCore. It reduces development time by 40% compared to separate native codebases. But some features (e.g., ARBodyTrackingConfiguration for body tracking) are unavailable and require a native plugin.

For React Native and Flutter, direct AR Foundation is missing. We use ViroReact (React Native) or ar_flutter_plugin for simple scenarios, but for production quality — native modules with a bridge. Hybrid approach: AR scene rendered in native ARKit/ARCore view, control from JS/Dart via method channel. Included in our standard delivery.

Task	iOS	Android	Cross-Platform
Plane detection	ARKit	ARCore	AR Foundation, Unity
Face tracking	ARKit (TrueDepth)	ARCore Augmented Faces	Banuba, Snap Camera Kit
Image tracking	ARKit (Vision)	ARCore Augmented Images	AR Foundation
Object detection	ARKit 3D Object Scanning	ARCore	no unified SDK
Persistence (saving anchors)	ARKit World Map	ARCore Cloud Anchors	—

Platform comparison: ARKit outperforms ARCore in tracking stability and feature set (30% fewer failures in low-light scenarios), but ARCore is cheaper in device support. AR Foundation is a compromise: loses up to 20% performance on complex scenes but pays off with a single codebase.

Try-on: product fitting via AR

Fitting glasses, jewelry, cosmetics — a separate class of tasks. Here, face tracking is needed, not plane detection.

ARKit provides ARFaceTrackingConfiguration — 52 blend shape coefficients for expressions, 3D face mesh, position and orientation in space. Works only on devices with TrueDepth camera (iPhone with Face ID).

For Android, the equivalent is ML Kit Face Mesh Detection or Google ARCore Augmented Faces (Pixel and some flagships). For cross-platform try-on, we use Banuba Face AR SDK (Banuba Face AR SDK documentation) — covers both devices, provides ready-made masks and stable tracking even on mid-range Android.

Try-on quality critically depends on 3D product models. Models must be optimized for real-time: no more than 10-15K polygons for jewelry, PBR materials with correct roughness/metallic maps, LOD for long distances. Within our engagement, we provide ready-made optimization guides.

How to achieve realistic lighting in AR?

ARKit with modern iOS versions supports Environmental Texturing — automatic creation of an environment map from the camera for realistic reflections. Enabled via ARWorldTrackingConfiguration.environmentTexturing = .automatic. Without it, metallic and glass materials look plastic.

ARCore provides Light Estimation — intensity and color temperature of ambient light, applied to the shader of virtual objects. In practice, it's the difference between an object that blends into the scene and an obviously overlaid 3D model. We guarantee that the final image doesn't betray virtuality.

What's included

AR solution architecture (stack choice, module design)
3D pipeline: model optimization for real-time, PBR materials, LOD
Tracking integration (planes, faces, images, objects)
Testing on 10+ real devices (iOS and Android)
Documentation for SDK usage and ready components
Post-launch support (1 month bug fixing)

Timeline and estimation

Simple AR scene with placing one 3D model on a plane — 1-2 weeks. Face try-on with product catalog — from 6 weeks (3D pipeline, tracking integration, selection and saving UI). Full AR shopping with cloud anchors and multiplayer — from 3 months. We'll estimate your project in 1 day — contact us to discuss your AR idea.