Caltrain HSR Compatibility Blog: Formulation of a Service Quality Metric

06 October 2012

Formulation of a Service Quality Metric

The quantitative formulation of an overall quality metric, which can be extracted from an arbitrary timetable, is necessary to objectively answer the question “is proposed timetable A better than proposed timetable B?”

Such metrics facilitate the trade-study and optimization process of planning a new timetable, and must take into account several factors, including not just the quality of the service provided to passengers but also other factors that passengers don’t think about, such as robustness to disruption, fleet size and crew time considerations.

For today, however, we will focus exclusively on quantifying the quality of the service provided to passengers. This particular formulation proceeds in eight reasonably simple steps, pulling together earlier information on timetable metrics and demographics. It is only one example of how one might formulate a service quality metric, something that Caltrain has never explicitly done and could benefit greatly from doing as they share the pros and cons of various blended service plans. This is one way to do it; what's theirs?

Step 1: Extract trip time and wait time statistics for each origin and destination pair. By straightforward analysis of the timetable, one can figure all the possible trips between any origin station A and destination station B (including transfers) during a one-hour span during the morning peak. One can then determine (in units of time):

The average trip time between A and B (Tmean_AB)
The fastest trip time between A and B (Tmin_AB)
The average wait between trips that connect A and B (Wmean_AB)
The longest wait between trips that connect A and B (Wmax_AB)

The first two metrics measure trip time on board the train, and the next two can be used as a proxy for measuring typical wait times on the platform. The trip time and wait time figures are intrinsic to the timetable and can be extracted by a computer program.

Step 2: Compute an “effective” trip time from A to B by computing a weighted sum of the time components extracted above. This is where judgment calls start to be made. Taking into account the waiting times Wmean and Wmax is just as important as the actual trip times Tmin and Tmean, in order to properly account for the frequency of service. For example, the effective trip time could be defined as:

Teff_AB = (30% of Tmin_AB + 70% of Tmean_AB) + (20% of Wmean_AB + 15% of Wmax_AB)

The trip time term (30% of Tmin_AB + 70% of Tmean_AB) accounts for some trips being shortened by express service. The waiting time term (20% of Wmean_AB + 15% of Wmax_AB) properly penalizes long service gaps, but remains shorter than the waiting time incurred when the passenger shows up randomly, which is 50% of Wmean_AB. This lower weighting reflects the fact that passengers don’t show up randomly, but usually time their arrival at origin A for a particular trip to destination B. For example, when trips are available every 15 minutes, the waiting term works out to a quite reasonable 5 minutes. The effective trip time is a reasonably good measure of how long it will take you to get from A to B.

Step 3: Determine the “effective” speed between origin A and origin B. This is simply distance divided by time, or: V_AB = d_AB / Teff_AB where d_AB is the distance between A and B. This process is repeated for every origin and destination pair A-B, and describes not the speed of a train, but the average speed of a typical trip from A to B including waiting time, based only on the available service provided by the specific timetable being considered.

Step 4: Compute weighting by population and jobs. This is where census data enters the calculation, as it must. For the morning rush hour, since ridership consists primarily of people going from their home near A to their work near B, we calculate a potential ridership weight based on how many people live near A and how many people work near B. This simply reflects that if a lot of people live near A and work near B, it is more important to provide fast service between A and B than between other station pairs where fewer people and jobs are located.

The “home weight” Whome_A of origin station A is a simple gravity sum (1/r squared law) of the residential population, taken from the 2010 census, as described previously in greater detail. Each person is divided by the square of how far they live from station A, to reflect that people who live further away from the station are less likely to use it. To prevent over-counting people who live very close to the station (where the 1/r squared term diverges), anyone living closer than ¼ mile from the station is considered to live ¼ mile away. The resulting weights are shown at left, in orange.

Similarly, the “work weight” Wwork_B of destination station B is a simple gravity sum of the number of jobs over $40k, again taken from census data. Each job is divided by the square of how far it is from station B, to reflect that people who work further away from the station are less likely to use it. Once again, to prevent over-counting jobs located very close to the station, any job closer than ¼ mile from the station is considered ¼ mile away. The resulting weights are shown at right, in blue.

Step 5: Compute weighting by distance. Regardless of where people live and work, there are upper and lower limits to how far they will typically commute by rail. Extremely short trips are less likely because of the overhead of access and egress to and from the station at each end of the journey. Conversely, extremely long trips are less likely because of their sheer duration. As it turns out, the typical rush hour trip on Caltrain turns out to be about 25 miles, or 40 km.

For our purposes, the distance weighting is constructed by drawing a curve with a peak at 40 km. This distance weight starts off at zero for a trip distance of less than 7 km (reflecting no demand for such short trips), peaks at a distance of 40 km, and decays slowly thereafter. Converted to miles, it looks like the figure at left. The underlying math to draw this curve is a Rayleigh distribution with a peak at (d-7) = 33, where d is the trip distance in km.

Step 6: Combine the population, jobs and distance weights to obtain a ridership potential matrix. The ridership potential matrix R is a matrix of size N squared, where N is the number of stations. Each element R_AB of this matrix represents the "potential" ridership (in arbitrary relative units) that can be tapped into during the morning commute from origin A to destination B. This ridership potential matrix has an important property: it is independent of any timetable, and concisely describes the underlying demand that inherently exists out there--regardless of how or whether that demand is met by rail service. Each element R_AB is given by the product:

R_AB = Whome_A * Wwork_B * Wdistance_AB

Note that the matrix R is not symmetric, because the number of residents and jobs near each station differs. For example, far more people will want to commute to SF Transbay in the morning than from it, since the number of jobs within a half mile of that station is greater than all the jobs within a half mile of every other Caltrain station all the way to Gilroy combined.

Step 7: Compute the service quality matrix. The service quality matrix Q is again a matrix of size N squared, where N is the number of stations. Each element Q_AB of this matrix represents the quality of morning rush hour service from station A to station B, and is given by the following formula:

Q_AB = R_AB * V_AB

This combines R_AB, the timetable-independent ridership potential from origin A to destination B, with V_AB, the timetable-dependent effective speed from A to B. If you have a preferred AM origin and destination (as most commuters do), then you can compare your Q_AB for various timetables to see how any given timetable will meet your own specific needs.

Step 8: Extract overall service quality scores. The service quality metrics must be bench marked against some reference, so they are simply normalized against the most current timetable. That means today's timetable will score 100, by definition. By adding the elements of Q over all possible origin and destination pairs, we can quantify the degree of service improvement and compute a score for the entire timetable as well as a score for each individual station. The overall timetable service quality score is S = ΣQ / Sref, i.e. the sum of all the elements of Q divided by the corresponding sum for today's timetable.

An entire timetable can now be distilled to its essence, a single service quality score.

We are now empowered to compare various timetables and understand quantitatively the pros and cons of each. This method will tell you objectively whether timetable A provides better overall service than timetable B--and if you happened to disagree with the scoring outcome, then your argument would be with the scoring method and not any particular detail of this or that proposed timetable. Beyond the mathematical minutiae of the rather simple scoring method presented here, the larger point is that there needs to be a defined scoring process and a framework for stakeholders to discuss what makes a good timetable. This scoring process is absolutely essential for planning future blended service on the peninsula. Caltrain's approach so far has been to prescribe a certain skip-stop pattern (see Tables 7 and 8) and restrict all analysis to that particular pattern, seemingly without regard to overall service quality!

42 comments:

Martin06 October, 2012 22:51
Clem,
One factor that's not analyzed well, and not sure how to factor it are the shuttles.

In Palo Alto, half the train boards the marguerite shuttles.

My employer runs a shuttle to MV that's timed with 3/5 baby bullet trains in morning and evening. The shuttle ride is probably another 5 miles. What's interesting is that Sunnyvale and Lawrence stations are closer, but Mountain View has much better service. (It's also a faster straight shot down 237).

However, given the good service, most employers settled on MV as their main shuttle stop. (Yahoo, Apple, etc...) This is actually good, as that cooperation ensures large passenger volumes which are rewarded by fast and frequent service.

If all the employers were split evenly between Lawrence / Sunnyvale / MV, then trip time would suffer by servicing all three stations, or frequency would suffer due to express trains getting distributed across those 3 stations.
ReplyDelete
Replies
Alon07 October, 2012 01:59
You overlook how some of the assumptions you make themselves depend on service quality. If off-peak service is so bad nobody rides it, it will look as if everyone or almost everyone rides at the peak, making off-peak frequency appear less important. If frequency and lack of fare integration make sure everyone from Millbrae north uses BART instead of Caltrain, it will look as if few people make short trips, making express speed seem more important. If fares are too high, it will look as if low-income people don't travel at commuter rail distances. And so on.

In the longer term, development patterns depend on transit service quality. Compare development around subway and commuter rail stations. It's less relevant to the Peninsula because zoning laws and PAMPA NIMBYs ensure that high-intensity development around Caltrain stations will be sparse, but it's relevant to a service in an area with more rational zoning laws.
ReplyDelete
Replies
Martin07 October, 2012 23:08
Here's one question. Since Baby Bullet service come into place, the ridership to Cal Ave went down, while at Palo Alto went up way up. Many riders probably shifted from one stop to the other. Some riders might even need a shuttle or a bike where a walk would've sufficed. How do you evaluate if that's a good thing? Obviously, more PA passengers means more service (see extra PA service added on Oct 1st), so that's all good there.

But is that a good thing overall? When Broadway station closed, some people switched to cars, but others probably went to Millbrae and Burlingame. When Atherton closed, rider went to Menlo Park and RWC. Closing those stations, is obviously good overall since end-to-end time speeds up for everyone.

Clem, can your formula rate stations like that? For example, would overall ridership increase if South SF station were closed, and riders went (or were bused to) either Bayshore or San Bruno? Or would it be better to increase ridership to that station from 1/hour to 2/hour? Could we run such analysis on other staions?
ReplyDelete
Replies
Caelestor08 October, 2012 09:47
One important factor that this methodology doesn't account for is the fare zone system in place, which has probably diverted ridership from stations near the edge of one zone to adjacent ones at the edge of another zone. For example, Atherton has lost all its riders to RWC, while Sunnyvale will continue to see ridership increases at the detriment of Lawrence if round trips are $4 cheaper. This phenomenon affects the home_weight variable in the service quality metric.

I'll post some more thoughts later.

ReplyDelete
Replies
Richard Mlynarik09 October, 2012 12:48
The "push model" is also 100% based around driving to the origin station.

It simply doesn't work for destination ("work", in most cases) -- except for the minority of people who bring bicycles along.(a percentage which will have to decrease over time, for space reasons.)

Those many trains that used to express through Palo Alto until a week ago? THey did it because the "push model" said that PA parking lots were full, therefore everything else (Stanford, Marguerite, ...) can be ignored?

That one train PER HOUR peak at California Avenue? Just ignore the Stanford Business Park and its tens of thousands of employees, several hundred of whom formerly used the California Avenue stop.

Drive-to-station home-end commuters can and do drive a few extra miles based upon parking availability and parking price and upon train schedule.
For the entire rest of the potential market? A very poor service planning fit, indeed.

As for Caltrain's existing zone system and the like: yes, that skews existing ridership. (Existing ridership is skewed far more by wretched frequency, crazy skip-stop patterns, unreliability, and lack of connectivity.) But this isn't about exact modelling of Caltrain's present ridership: it's about meeting latent demand.

As for the argument that the census-based model neglects existing transit connections at stations: first, transit access is (sadly) pretty much negligible except at the SF terminal and Palo Alto. Marguerite and the forced transfers to Muni, and that's it to a first approximation. SJ and Millbrae have a few dribs and drabs. along with the employer shuttles at one trip end only. There really is no "network" and there are no "connections" that anybody with a choice endure.

But even aside from that, in a remotely rational world you'd expect transit availability to track housing and employment density (I mean, nobody would as insane to build a light rail line that runs slower than a bus and run where there are no passengers, right? RIGHT?), so the crude-but-useful demographic heuristics than Clem is using (and using transparently) are actually good proxies, and especially good proxies for where service ISN'T being provided but SHOULD be -- Caltrain as well as connecting transit service.

Yes, there are a few special cases -- university-driven "over-use"; geography like SF's Potrero Hill suppressing the simple distance-from-station counts -- but on the whole I think the data are hard to contest and the simple-but-not-misleading model built around them is interesting, not stupid, transparent, and worthy of attention.

If you want to add special cases, I'd throw in a two minute penalty for every northbound train at California Avenue thanks to Caltrain's knowingly stupidly misplaced tunnel. And add an extra minute or two minute southbound at SF for the new mandatory line-up-and-be-counted-citizens non-POP ticket check queue. And add an extra two minutes at Millbrae for the slow and lengthy escalator ride to the stars and back. And ... But let's get real, and keep things straightforward and avoid special pleading and special cases and try to achieve some idea of The Big Picture.
ReplyDelete
Replies
Jeff Carter27 October, 2012 13:58
Interesting discussion… I have never been a fan of so-called memory/clock face schedules if they are used to the detriment of regular customers. Is there any evidence that such schedules actually increase ridership?
As opposed to schedules that reflect when people start and end work?

I have had this discussion with a couple of my Caltrain advocate buddies some years ago. They would argue that the 3:45 pm train should leave SF at 4:00 pm instead, making it less useful to customers that get off work at 3:30 pm in San Francisco. Or the first train should leave San Jose at 5:00 am instead of 4:50 am to ‘increase’ ridership---‘memory schedule.’ However, that change would make the first train completely useless for those of us who start work in San Francisco at 6:30 am, therefore causing a loss of ridership.

They point out that with the memory schedule; you don’t need to have a timetable… Well, that argument was used by BART---Trains will run so frequently that you don’t need to have a schedule. For years BART did not publish schedules, they just got by with the time/fares matrix. But after years of customer requests, BART final started publishing timetables, customers want to know when a train would arrive at their origin station and arrive at their destination station. There was also this ‘condensed’ timetable thing… i. e. showing the bus at each time point… “Then every 15 minutes at the same times each hour until…” Samtrans used to have this type of timetable for the mainline/El Camino routes (5L, 5M, 7B, 7F) back in the days, but customers did not like these kinds of timetables, so Samtrans eventually started publishing schedules that show every bus at each time point throughout the day.

Then there are those who defend the current Caltrain schedule, citing that they are matching the service to the demand. Well the so-called ‘demand’ is actually reflective upon the schedule. Stations with shitty service will have shitty ridership; stations with good service will have good ridership. The illogical zone boundaries/system does contribute to some stations poorer ridership. This system grossly penalizes short trips. The primary reason for the current zone boundaries/lengths was to make the Caltrain fare from Millbrae to SF comparable to the BART fare from Millbrae to downtown SF. This system needs to be changed, the sooner the better. Either shorter zones or point to point, although in some cases a couple stations may be better grouped together, i. e. Palo Alto/Stanford/California; or San Francisco/22nd. Short trips should NOT be forced to use the bus, just because someone in Caltrain management or one of my fellow advocate buddies thinks they should… Which was another argument that I had with them, the idea was that Caltrain should close 22nd street so the train would be ‘faster’ and increase ridership. Those using 22nd can use the bus/shuttle get to Caltrain at 4th/King. This scenario is more likely to kill off ridership than to increase ridership, some people have no idea of the meaning of convenience.

The idea should be for Caltrain to be frequent enough that customers don’t have to think about or build their life around the schedules. The basic service should be about every 15-20 minutes from 4:00 am till at least midnight. Peak hours would have additional express/bullet service.

Infrequent service give people the impression that Caltrain is ‘slow’ compared to BART is ‘fast’… primarily due to the very frequent BART service.

Have you tested BART like service (every 15-20 minutes, 4:00 am till midnight) in your calculator?
ReplyDelete
Replies
Anonymous28 October, 2012 19:26
In Japan the urban private railways generally run a show up and go service with 2~3 minute headways schedule during the morning and evening peaks, and a "pattern" schedule in the off-peak between 10am and 4:00pm. The off peak service is typically a local, semi-express, and express pattern repeating every 20 minutes or so. Casual and occasional users find the system easy to understand, and regular commuters already know their train times and timepoints at every station on their route as the railways publish their complete schedules (often available as paper timetables at bookstores, in addition to on the net).
ReplyDelete
Replies
Adina14 December, 2012 21:36
Another way to use the tool is projected population/jobs. San Antonio has an atrocious schedule today. Mountain View is looking to upzone, the area is attracting development, and there are lively debates about whether and how much to improve bike/ped access in the area including to the train. The question about PA/Cal Ave/San Antonio schedule is partly a land use question.

Caltrain is not predominantly a park-n-ride system: as of 2010 "39 percent of the system’s riders drive to the station or are dropped off; 27 percent walk; 22 percent take transit; and 8 percent bike." It's problematic to assume that riders will compensate for station/schedule changes by driving further when 60% aren't driving to the station.
http://www.caltrain.com/about/News_Archive/Caltrain_Seeks_Input_on_New_Access_Policy.html
ReplyDelete
Replies

Add comment

Caltrain HSR Compatibility Blog

06 October 2012

Formulation of a Service Quality Metric

42 comments:

Recent Comments

Corridor To Do List

Focus On...

Search This Blog

Blog Archive

Reference Desk

Connections

About This Blog

Caltrain HSR Compatibility Blog

06 October 2012

Formulation of a Service Quality Metric

42 comments:

Recent Comments

Corridor To Do List

Focus On...

Search This Blog

Blog Archive

Reference Desk

Connections

Subscribe To

About This Blog