Speech format vs size