Wirevolution

Mobile Unified Communications

Subscribe!

Archive for the ‘HD Voice’ Category

Orange HD Voice Demo on Sky TV

Friday, January 28th, 2011

Martin Stanford of Sky TV’s “Tech Talk” does a video blog demonstrating HD Voice on a Nokia phone. There is no latency, indicating some kind of post-processing of the video, but it’s still a nicely done and illustrative demo of HD Voice.

ITExpo West — Achieving HD Voice On Smartphones

Friday, October 1st, 2010

I will be moderating a panel discussion at ITExpo West on Tuesday 5th October at 11:30 am in room 306B: “Achieving HD Voice On Smartphones.”

Here’s the session description:

The communications market has been evolving to fixed high definition voice services for some time now, and nearly every desktop phone manufacturer is including support for G.722 and other codecs now. Why? Because HD voice makes the entire communications experience a much better one than we are used to.

But what does it mean for the wireless industry? When will wireless communications become part of the HD revolution? How will handset vendors, network equipment providers, and service providers have to adapt their current technologies in order to deliver wireless HD voice? How will HD impact service delivery? What are the business models around mobile HD voice?

This session will answer these questions and more, discussing both the technology and business aspects of bringing HD into the mobile space.

The panelists are:

This is a deeply experienced panel; each of the panelists is a world-class expert in his field. We can expect a highly informative session, so come armed with your toughest questions.

ITExpo West — Building Better HD Video Conferencing & Collaboration Systems

Friday, October 1st, 2010

I will be moderating a session at ITExpo West on Tuesday 5th October at 9:30 am: “Building Better HD Video Conferencing & Collaboration Systems,” will be held in room 306A.

Here’s the session description:

Visual communications are becoming more and more commonplace. As networks improve to support video more effectively, the moment is right for broad market adoption of video conferencing and collaboration systems.

Delivering high quality video streams requires expertise in both networks and audio/video codec technology. Often, however, audio quality gets ignored, despite it being more important to efficient communication than the video component. Intelligibility is the key metric here, where wideband audio and voice quality enhancement algorithms can greatly improve the quality of experience.

This session will cover both audio and video aspects of today’s conferencing systems, and the various criteria that are used to evaluate them, including round-trip delay, lip-sync, smooth motion, bit-rate required, visual artifacts and network traversal – and of course pure audio quality. The emphasis will be on sharing best practices for building and deploying high-definition conferencing systems.

The panelists are:

  • James Awad, Marketing Product Manager, Octasic
  • Amir Zmora, VP Products and Marketing, RADVISION
  • Andy Singleton, Product Manager, MASERGY

These panelists cover the complete technology stack from chips (Octasic), to equipment (Radvison) to network services (Masergy), so please bring your questions about any technical aspect of video conferencing systems.

ITExpo West — The State of VoIP Peering

Friday, October 1st, 2010

I will be moderating a session at ITExpo West on Monday 4th October at 2:15 pm: “The State of VoIP Peering,” will be held in room 304C.

Here’s the session description:

VoIP is a fact – it is here, and it is here to stay. That fact is undeniable. To date, the cost savings associated with VoIP have largely been enough to drive adoption. However, the true benefits of VoIP will only be realized through the continued growth of peering, which will keep calls on IP backbones rather than moving them onto the PSTN. Not only will increased peering continue to reduce costs, it will increase voice call quality – HD voice, for instance, can only be delivered on all-IP calls.

Of course, while there are benefits to peering, traditional carriers have traditionally not taken kindly to losing their PSTN traffic, for which they are able to bill by the minute. But, as the adoption of IP communications continues to increase – and of course the debate continues over when we will witness the true obsolescence of the PSTN – carriers will have little choice but to engage in peering relationships.

This session will offer an market update on the status of VoIP peering and its growth, as well as trends and technologies that will drive its growth going forward, including wideband audio and video calling.

The panelists are:

This is shaping up to be a fascinating session. Rico can tell us about the hardware technologies that are enabling IP end-to-end for phone calls, and Mark and Grant will give us a real-world assessment of the state of deployment, the motivations of the early adopters, and the likely fate of the PSTN.

HD Voice, Peering and ENUM

Friday, May 7th, 2010

The most convenient route between telephone service providers is through the PSTN, since you can’t offer phone service without connecting to it. Because of this convenience telephone service providers tend to consider PSTN connectivity adequate, and don’t take the additional step of delivering IP connectivity. This is unfortunate because it inhibits the spread of high quality wideband (HD Voice) phone calls. For HD voice to happen, the two endpoints must be connected by an all-IP path, without the media stream crossing into the PSTN.

For example, OnSIP is my voice service provider. Any calls I make to another OnSIP subscriber complete in HD Voice (G.722 codec), because I have provisioned my phones to prefer this codec. Calls I make to phone numbers (E.164 numbers) that don’t belong to OnSIP complete in narrowband (G.711 codec), because OnSIP has to route them over the PSTN. If OnSIP was able to use an IP address for these calls instead of an E.164 number, it could avoid the PSTN and keep the call in G.722.

Xconnect has just announced an HD Voice Peering initiative, where multiple voice service providers share their numbers in a common directory called an ENUM directory. When a subscriber places a call, the service provider looks up the destination number in the ENUM directory; if is there, it returns a SIP address (URI) to substitute for the phone number, and the call can complete without going over the PSTN. About half the participants in the Xconnect trial go a step further than ENUM pooling: they interconnect (“peer”) their networks directly through an Xconnect router, so the traffic doesn’t need to traverse the public Internet. [See correction in comments below]

There are other voice peering services that support this kind of HD connection, notably the VPF (Voice Peering Fabric). The VPF has an ENUM directory, but as the name suggests, it does not offer ENUM-only service; all the member companies interconnect their networks on a VPF router.

Some experts maintain that for business-grade call quality, it is essential to peer networks rather than route over the public Internet. Packets that traverse the public Internet are prone to delay and loss, while properly peered networks deliver packets quickly and reliably. In my experience, this has not been an issue. My access to OnSIP and to Vonage is over the public Internet, and I have never had any quality issues with either provider. From this I am inclined to conclude that explicit peering of voice networks is overkill, and that if you have a VoIP connection all that is needed for HD voice communication is to list your phone number in an ENUM directory. Presumably the voice service providers in Xconnect’s trial that are not peering share this opinion.

Xconnect’s ENUM directory is enormous, partly because it is pooled with Pathfinder – the GSMA ENUM directory administered by Neustar. Xconnect’s ENUM directory had over 120 million numbers in it as of 2007.

Xconnect and the VPN only add to their ENUM directories the numbers owned by their members. But even if you are not a customer of one of their members, you can still list your number in an ENUM directory, e164.org. This way, anybody who checks for your number in the directory can route the call over the Internet. Calls made this way don’t need to use SIP trunks, and they can complete in HD voice.

If you happen to have an Asterisk PBX, you can easily provision it to check in a list of ENUM directories before it places a call.

HD Voice – state of deployment

Monday, January 11th, 2010

At the HD Voice Summit in Las Vegas last week, Alan Percy of AudioCodes gave a presentation of the state of deployment of HD Voice, citing three levels of deployment: announced interest, trials and service deployment.

Percy’s take was that in the “Crossing the Chasm” technology adoption lifecycle, HD Voice is right at the chasm.

Here is his list, augmented with input from Jan Linden of GIPS,Tom Lemaire of FT/Orange, Doug Mohney of HD Voice News and Dave Erickson of Wyde Voice:

Category Company Stage
PC VoIP Skype >500 m downloads
QQ (China) >500 m downloads
Gizmo5 (now Google)
Wireline telco France Telecom 500K HD users
British Telecom Trials
FT/Orange Spain Deployed 1Q09
FT/Orange Poland Deploys 1Q10
Mobile Orange (Moldova) Production
Orange (UK) Deploys 3Q10
Orange (Belgium) Deploys 2010
CLEC VoIP Alteva Production
SimpleSignal Production
Ooma 25K HD users
8×8 >70K HD users
OnSIP Production
Phone.com Trials
US MSOs CableVision/Lightpath Limited Trials
Conferencing ZipDX Production
ClearOne Production
Citrix Production
FreeConferenceCall.com Production
Global Crossing Limited Trials

The main codecs in each of these deployments are: Skype:SILK; QQ, Citrix, Freeconferencecall:iSAC; mobile:AMR-WB; all others: G.722.

Alan pointed out the conspicuous lack of involvement of the cable companies (MSOs), even though Cable Labs has done a good job of creating HD specifications for them.

HD Voice Numbers

Friday, January 8th, 2010

50% of consumers say they would change their telephone service provider to get better sound quality, according to Tom Lemaire, Sr. Director of Business Development at Orange/France Telecom North America, speaking at the CES HD Voice Summit this week (Orange/France Telecom has the largest deployment of HD Voice of any traditional telco). Rich Buchanan, Chief Marketing Officer at Ooma, said at the same session that his surveys show that 65% of consumers would change their provider to get better voice quality.

Bearing in mind that we know from observation that consumers value both mobility and price above call quality, these survey numbers fall into the “interesting if true” category.

Lemaire and Buchanan also said that their logs show that the average call in HD lasts longer than the average narrowband call, though they didn’t give numbers.

VoIP on the cellular data channel

Thursday, September 17th, 2009

In a recent letter to the FCC, AT&T said that it had no objection to VoIP applications on the iPhone that communicate over the Wi-Fi connection. It furthermore said:

Consistent with this approach, we plan to take a fresh look at possibly authorizing VoIP capabilities on the iPhone for use on AT&T’s 3G network.

So why would anybody want to do VoIP on the cellular data channel, when there is a cellular voice channel already? Wouldn’t voice on the data channel cost more? And since the voice channel is optimized for voice and the data channel isn’t, wouldn’t voice on the data channel sound even worse than cellular voice already does?

Let’s look at the “why bother?” question first. There are actually at least four reasons you might want to do voice on the cellular data channel:

  1. To save money. If your voice plan has some expensive types of call (for example international calls) you may want to use VoIP on the data channel for toll by-pass. The alternative to this is to use the voice channel to call a local access number for an international toll by-pass service (like RebTel.)
  2. To get better sound quality: the cellular voice codecs are very low bandwidth and sound horrible. You can choose which codec to run over the data network and even go wideband. At IT Expo West a couple of weeks ago David Frankel of ZipDX demoed a wideband voice call on his laptop going through a Sprint Wireless Data Card. The audio quality was excellent.
  3. To get additional service features: companies like DiVitas offer roaming between the cellular and Wi-Fi networks that makes your cell phone act as an extension behind your corporate PBX. All these solutions currently use the cellular voice channel when out of Wi-Fi range, but if they were to go to the data channel they could offer wideband codecs and other differentiating features.
  4. For cases where there is no voice channel. In the example of David Frankel’s demo, the wireless data card doesn’t offer a voice channel, so VoIP on the data channel is the only option for a voice connection.

Moving on to the issue of cost, an iPhone unlimited data plan is $30 per month. “Unlimited” is AT&T’s euphemism for “limited to 5GB per month,” but translated to voice that’s a lot of minutes: even with IP packet overhead the bit-rate of compressed HD voice is going to be around 50K bits per second, which works out to about 13,000 minutes in 5GB. So using it for voice is unlikely to increase your bill. On the other hand, many voice plans are already effectively unlimited, what with rollover minutes, friend and family minutes, night and weekend minutes and whatnot, and you can’t get a phone without a voice plan. So for normal (non-international) use voice on the data channel is not going to reduce your bill, but it is unlikely to increase it, either.

Finally we come to the issue of whether voice sounds better on the voice channel or the data channel. The answer is, it depends on several factors, primarily the codec and the network QoS. With VoIP you can radically improve the sound quality of a call by using a wideband codec, but do impairments on the data channel nullify this benefit?

Technically, the answer is yes. The cellular data channel is not engineered for low latency. Variable delays are introduced by network routing decisions and by router queuing decisions. Latencies in the hundreds of milliseconds are not unusual. This will change with the advent of LTE, where the latencies will be of the order of 10 milliseconds. The available bandwidth is also highly variable, in contrast to the fixed bandwidth allocation of the voice channel. It can sometimes drop below what is needed for voice with even an aggressive variable rate codec.

In practice VoIP on the cellular data channel can sometimes sound much better than regular cellular voice. I mentioned above David Frankel’s demo at IT Expo West. I performed a similar experiment this morning with Michael Graves, with similarly good results. I was on a Polycom desk phone, Michael used Eyebeam on a laptop, and the codec was G.722. The latency on this call was appreciable – I estimated it at around 1 second round trip. There was also some packet loss – not bad for me, but it caused a sub-par experience for Michael. Earlier this week at Jeff Pulver’s HD Connect conference in New York, researchers from Qualcomm demoed a handset running on the Verizon network using EVRC-WB, transcoding to G.722 on Polycom and Gigaset phones in their lab in San Diego. The sound quality was excellent, but the latency was very high – I estimated it at around two seconds round trip.

The ITU addresses latency (delay) in Recommendation G.114. Delay is a problem because normal conversation depends on turn taking. Most people insert pauses of up to about 400 ms as they talk. If nobody else speaks during a pause, they continue. This means that if the one-way delay on a phone conversation is greater than 200 ms, the talker doesn’t hear an interruption within the 400 ms break, and starts talking again, causing frustrating collisions.
The ITU E-Model for call quality identifies a threshold at about 170 ms one-way at which latency becomes a problem. The E-Model also tells us that increasing latency amplifies other impairments – notably echo, which can be severe at low latencies without being a problem, but at high latencies even relatively quiet echo can severely disrupt a talker.

Some people may be able to handle long latencies better than others. Michael observed that he can get used to high latency echo after a few minutes of conversation.

HD Communications Project

Monday, August 10th, 2009

As part of the preparation for the fall HD Communications Summit, Jeff Pulver has put up a video clip promoting HD Voice for phone calls. It goes over the familiar arguments:

  • Sound quality on phone calls hasn’t improved since 1937. Since most calls are now made on cell phones, it has actually deteriorated considerably.
  • The move to VoIP has made it technically feasible to make phone calls with CD quality sound or better, yet instead VoIP calls are usually engineered to sound worse than circuit-switched calls (except in the case of Skype.)
  • Improved sound quality on phone calls yields undisputed productivity benefits, particularly when the calls involve multiple accents.
  • Voice has become a commodity service, with minimal margins for service providers, yet HD Voice offers an opportunity for differentiation and potentially improved margins.

The HD Communications Summit is part of the HD Connect Project. The HD Connect Project aims to provide a coordination point for the various companies that have an interest in propagating HD Voice. These companies include equipment and component manufacturers, software developers and service providers.

Among the initiatives of the HD Connect Project is a logo program, like the Wi-Fi Alliance logo program. The logo requirements are currently technically lax, providing an indicator of good intentions rather than certain interoperability. Here’s a draft of the new logo:

HD Connect Draft Logo

Another ingredient of the HD Connect project is the HDConnectNow.org website, billed as “the news and information place for The HD Connect Project.”

It is great that Jeff is stepping up to push HD Voice like this. With the major exception of Skype almost no phone calls are made with wideband codecs (HD Voice). Over the past few years the foundation has been laid for this to change. Several good wideband codecs are now available royalty free, and all the major business phone manufacturers sell mostly (or solely) wideband-capable phones. Residential phones aren’t there yet, but this will change rapidly: the latest DECT standards are wideband, Gigasets are already wideband-capable, and Uniden is enthusiastic about wideband, too. As the installed base of wideband-capable phones grows, wideband calling can begin to happen.

Since most dialing is still done with old-style (E.164) phone numbers, wideband calls will become common within companies before there is much uptake between companies. That will come as VoIP trunking displaces circuit-switched, and as ENUM databases are deployed and used.

GIPS webinar on Wideband Voice

Wednesday, July 15th, 2009

GIPS yesterday sponsored a webinar on HD voice in the teleconferencing and videoconferencing industries. It was introduced by Elliot Gold of Telespan Publishing. Elliot made the point that HD video will be running at 90% of video equipment sales by the end of this year, and that HD audio is at least equally important to the user experience and should be the next technology transformation in the equipment and services markets.

Dovid Coplon of GIPS gave a more technical presentation, starting with a very accessible explanation of the benefits of HD audio.

Dovid made some interesting observations. He named some users of GIPS wideband technology including Google Talk and Yahoo Messenger. He cited a May 2009 poll that showed 10% of respondents used HD audio all the time or whenever they could. To me this is a surprisingly high number, possibly reflecting a biased poll sample, or perhaps a large number of respondents that didn’t understand the question.

Dovid compared the benefits of the GIPS wideband codecs, iSAC and iPCM-WB. iSAC is a lower-bit-rate, higher complexity codec than IPCM-WB. Dovid pointed out that with IP packet overhead being so high, decreasing the bit-rate of a codec is not as useful as one might think. The implication was that the lower complexity of IPCM-WB outweighed its bandwidth disadvantage in mobile applications. He also included a codec quality comparison chart based on a MUSHRA scale. The chart predictably showed iSAC way better than any of the others, and anomalously showed Speex at 24 kbps as inferior to Speex at 16 kbps.

Dovid also echoed all the codec engineers I have talked with in emphasizing that the codec is a small piece of the audio quality puzzle, and that network and physical acoustic impairments can have a greater effect on user experience.

You can download the slides here.