Welcome!

Mobile IoT Authors: Pat Romanski, Elizabeth White, Yeshim Deniz, Liz McMillan, Kevin Benedict

Related Topics: Video, Mobile IoT, Symbian, Wearables

Video: Article

Technology Face-Off: Augmented Reality vs Mobile Image

AR vs. MIR vs. Mobile Tagging

There are 3 new technologies competing for attention on today’s mobile phones: Augmented Reality (AR), Mobile Image Recognition (MIR) and Mobile Tagging (MT). This briefing aims to provide a short introduction to all three, position them against each other and illustrate how all they can co-exist together.

Augmented Reality (AR)
AR uses the current view (front end) of the real world visible through your phone camera to display online content (back end) linked to specific objects located within the view. AR does this by using GPS co-ordinates to figure out a ‘start position’ of where you are and your phone’s built-in gyroscopic compass to orientate to specifically what you are looking at. The combination of where you are and what your orientation is allows the AR software to resolve to back end content that adds some perspective to what it is you are looking at. AR can also layer different ‘virtual’ views over your current ‘reality’ literally as you change it by changing your position or the orientation of your phone. This presents the potential for an ever-changing range of user-selectable perspectives to augment your reality with a wide variety ofonline information.

Mobile Image Recognition (MIR)

MIR matches an image that you focus on/snap with your phone camera (front-end) to content that is linked to that image (back-end). When you focus on or snap the image it is sent as a picture message to a back end application that matches the snapped image to a pre-existing image in a database in order to deliver content linked to that image back to the mobile phone.

Mobile Tagging (MT)
MT matches a unique image in a specific tag format (front end) that you snap with your phone camera to display content that is linked to that image (back end). The tag snap could simply resolve to a website URL or piece of text that is delivered back to the mobile phone user via their browser or it could link to a purpose-built mobile page or site that in ‘pointed’ to via a pointer in a back end database.

Differentiating
AR is location and orientation centric. It is concerned with linking content to your current real-world context as represented by your phone’s location position and orientation. AR can present different perspectives on your current locational context. AR is great for:

• Finding stuff nearby where you are now

• Learning more about what you see around you

• Connecting to or finding out about people around you

A drawback with AR is that the content layers need to be linked to positioning co-ordinates in a back end database and will need to be regularly updated.

MIR is image centric. It is concerned with recognizing an image – wherever and on whatever it is located – in order to deliver content linked to that specific image. MIR could also use location-based data to vary the content delivered depending on location. MIR is great for:

• Leveraging existing images (e.g. book/cd/dvd covers)

• Linking brand logos in print/on posters to online content

• Associating a face with online content

A drawback with MIR is that the image needs to have been processed and added to a back end database before it can be recognized and resolved to by the MIR-enabled phone camera.

MT is content centric. It is concerned with recognizing a specific kind of image - a tag - located on or next to a specific object or place or printed item (e.g. an advertisement) in order to deliver content linked to that tag. MT doesn’t care about location since this can be ‘pre-coded’ into the tag and in any case tagged objects are often not location-specific. MT is great for:

• Putting on physical items to link them to online content

• Linking a specific thing to specific online content

• Linking content in printed matter to online content

A drawback with MT is that the tag needs to be created and then deployed (i.e. stuck on, printed on) to the physical object.

Use Case: A Winery

Imagine you run a winery and need to understand how you can leverage these technologies to reach a new audience via their mobile phones: What could you do? Let’s say your rationale for leveraging these technologies is that you want to engage consumers with your wine and enhance the visitor experience of your winery tours.

Engage Wine Consumers
Here we are focused on the wine bottle and providing more information about the wine or encouraging the user to buy more.

AR would be a poor choice since the locational context of a wine bottle is largely irrelevant. However the wine bottle’s ‘front’ label could be used by MIR to deliver useful content when snapped as could a tag that has been printed on the wine bottles ‘back’ label. Both MIR and MT could enhance the experience of shopping for the wine (you could snap the label or tag in the store), drinking the wine (you could snap the label or tag before opening the bottle for serving/food tips) and make it easier to sell more wine or cross-sell specific wine related products (e.g. wine racks, openers or food).

Enhance Winery Tours
Here we are focused on enhancing the visitor’s experience of the winery while on a guided or self-guided tour. MIR is probably the least useful choice here since there will typically be few pre-existing images already present around the winery to recognize and link content to. MT is the next best choice, especially for a self-guided tour, as tags could be located at various ‘stops’ on a winery ‘trail’ so that they can be snapped to resolve to useful content about what you are looking at or where you are now. But AR is probably the best choice because a winery specific content layer can be developed so that as the visitor wanders around – whether part of a guided tour or self guided – they can augment the reality of what they see with the content delivered by AR.

In other words, AR, MIR and MT could all be used to your advantage.

Dependencies
All 3 technologies have dependencies:

Shared

• A camera-enabled mobile phone

• A fast internet connection available to ‘invoke’ the service

• An internet browser on the phone

• An AR/MIR/MT tag reader application installed on the phone

AR

• Location-based services on the mobile

• Gyroscopic compass built-in on the mobile

• A back end database with a content layer linked to the precise co-ordinates and orientation of a given location.

MIR

• MMS enabled on your mobile phone

• A back end database storing ‘pre-processed’ images that are used to match to the snapped image and linked to content.

• Optional - Location-based services on the mobile

MT

• Use of a specific tag format (e.g. QR Code, Datamatrix, MS Tag) to create the tag image to deploy • Optional - Location-based services on the mobile

All three also require some kind of mobile content management system (MCMS) for individuals/businesses to manage the AR content layer or the content linked to by an image or the content linked to by a tag. Some Links...

AR -  layar.com

MIR - kooaba.com

MT - vizitag.com

More Stories By Stewart McKie

Stewart McKie has 25 years of IT industry experience. His education includes a MSc in Organization Consulting and a MA in Screenwriting. I was the Technology Editor of Business Finance magazine during 1995-2000 and also wrote regular features for Intelligent Enterprise magazine. I am the author of six books on accounting software and over 50 technology white papers. My current focus is my screenwriting 2.0 app called Scenepad and my supply-chain auditing app. I have managed many ERP selections and implementations of SunSystems all over the world. Currently I am engaged as the Implementation Oversight consultant for a global AX2009 rollout for a manufacturing client and as the selection consultant for pan-European ERP solution.

Comments (2) View Comments

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


Most Recent Comments
Jamie Thompson 09/03/09 11:01:00 AM EDT

Mark - Pongr just did a blog post on exactly what you're talking about - location awareness AND image recognition. Although, from what we can tell, you are correct so far. Here it is.

Stewart - great article!

Mark Power 09/02/09 04:07:00 PM EDT

Very helpful article, Stewart, thanks!

At the risk of sounding completely clueless (as ever!), I'm wondering... I've seen example video demos of AR where the user points their camera at a building and gets overlayed images of archive photos of it. To me that sounds like a bit of both - location awareness AND image recognition. Presumably that's not the case though and it's just the former tech at work there...?

I just want to be sure I'm thinking it through correctly :)

IoT & Smart Cities Stories
The challenges of aggregating data from consumer-oriented devices, such as wearable technologies and smart thermostats, are fairly well-understood. However, there are a new set of challenges for IoT devices that generate megabytes or gigabytes of data per second. Certainly, the infrastructure will have to change, as those volumes of data will likely overwhelm the available bandwidth for aggregating the data into a central repository. Ochandarena discusses a whole new way to think about your next...
Dynatrace is an application performance management software company with products for the information technology departments and digital business owners of medium and large businesses. Building the Future of Monitoring with Artificial Intelligence. Today we can collect lots and lots of performance data. We build beautiful dashboards and even have fancy query languages to access and transform the data. Still performance data is a secret language only a couple of people understand. The more busine...
DXWorldEXPO LLC announced today that Big Data Federation to Exhibit at the 22nd International CloudEXPO, colocated with DevOpsSUMMIT and DXWorldEXPO, November 12-13, 2018 in New York City. Big Data Federation, Inc. develops and applies artificial intelligence to predict financial and economic events that matter. The company uncovers patterns and precise drivers of performance and outcomes with the aid of machine-learning algorithms, big data, and fundamental analysis. Their products are deployed...
All in Mobile is a place where we continually maximize their impact by fostering understanding, empathy, insights, creativity and joy. They believe that a truly useful and desirable mobile app doesn't need the brightest idea or the most advanced technology. A great product begins with understanding people. It's easy to think that customers will love your app, but can you justify it? They make sure your final app is something that users truly want and need. The only way to do this is by ...
CloudEXPO | DevOpsSUMMIT | DXWorldEXPO are the world's most influential, independent events where Cloud Computing was coined and where technology buyers and vendors meet to experience and discuss the big picture of Digital Transformation and all of the strategies, tactics, and tools they need to realize their goals. Sponsors of DXWorldEXPO | CloudEXPO benefit from unmatched branding, profile building and lead generation opportunities.
Digital Transformation and Disruption, Amazon Style - What You Can Learn. Chris Kocher is a co-founder of Grey Heron, a management and strategic marketing consulting firm. He has 25+ years in both strategic and hands-on operating experience helping executives and investors build revenues and shareholder value. He has consulted with over 130 companies on innovating with new business models, product strategies and monetization. Chris has held management positions at HP and Symantec in addition to ...
Cell networks have the advantage of long-range communications, reaching an estimated 90% of the world. But cell networks such as 2G, 3G and LTE consume lots of power and were designed for connecting people. They are not optimized for low- or battery-powered devices or for IoT applications with infrequently transmitted data. Cell IoT modules that support narrow-band IoT and 4G cell networks will enable cell connectivity, device management, and app enablement for low-power wide-area network IoT. B...
The hierarchical architecture that distributes "compute" within the network specially at the edge can enable new services by harnessing emerging technologies. But Edge-Compute comes at increased cost that needs to be managed and potentially augmented by creative architecture solutions as there will always a catching-up with the capacity demands. Processing power in smartphones has enhanced YoY and there is increasingly spare compute capacity that can be potentially pooled. Uber has successfully ...
SYS-CON Events announced today that CrowdReviews.com has been named “Media Sponsor” of SYS-CON's 22nd International Cloud Expo, which will take place on June 5–7, 2018, at the Javits Center in New York City, NY. CrowdReviews.com is a transparent online platform for determining which products and services are the best based on the opinion of the crowd. The crowd consists of Internet users that have experienced products and services first-hand and have an interest in letting other potential buye...
When talking IoT we often focus on the devices, the sensors, the hardware itself. The new smart appliances, the new smart or self-driving cars (which are amalgamations of many ‘things'). When we are looking at the world of IoT, we should take a step back, look at the big picture. What value are these devices providing. IoT is not about the devices, its about the data consumed and generated. The devices are tools, mechanisms, conduits. This paper discusses the considerations when dealing with the...