Combining multi-agent technology with a reinforcement learning model, a computer simulation model of travel modes and choice of departure time of commuters in peak hours is established.In a simulation, travel choice behaviors of commuters are studied with the consideration of impacts of vehicle restriction policies, and the formation of commuting equilibrium in peak periods is also reproduced.Based on simulation results, the effects of different measures for improving public transportations are analyzed.The results show that the number of commuters by bus increases by 18% after the implementation of restriction policies, which eases congestions in peak periods to a certain extent.Meanwhile, the probabilities that commuters travel by bus in unrestricted days become smaller, which means the effects of adopting restriction policies exclusively are fairly limited.Under the influences of restriction policies, if departure frequencies of public transport increase, the number of commuters travel by bus increases by 17.5%, and drivers′ waiting time in congestions decreases by 85%, which can effectively improve the traffic situations.Compared with that, reducing ticket price of public transport is less effective.The multi-agent approach applied in this study shows the richness in individual behaviors which can be realized intuitively and conveniently.It also has advantages in describing interactions between individuals and traffic systems, which provides an effective way to explore formation and evolution of complicated traffic phenomena.