if you havent seen it, try Unity (http://www.moock.org/unity)...it allows you to use a library of Java classes to define rooms, and then define your own data types, so you could start passing xy positions of avatars to the rest of the connected clients.

Its Java based, but it allows you to get rolling right off the install with chat systems etc., and with a bit of work you can could get the tracking system going.