Yes. they certainly could drape DRGs (topo maps) over the terrain, and probably will in the near future instead of the imagery. However, when they're pulling the data over the internet, any rendered system won't be as fast as a 2d one because it first has to calculate what data it needs to send, then send it and then render it. Once it's cached on your local computer it's fairly rapid to display until it needs a new dataset.
2d maps are relatively simple to do by comparison to 3d. All you're looking at on any of the online mapping programs is a flat picture of the map that's put up on a server temporarily with the extent you need and using some fancy clientside scripting to tell the computer on the other end where you clicked on the map so it can generate a new picture. It doesn't require anything other than the transmission of a picture to your computer. Google earth is a whole different can of beans because most of the number crunching is happening on your local machine instead of an array of servers that are directly connected to the datasets on at least a Gigabit network.
Bookmarks