Mystics & Statistics

TDI Friday Read: How Many Troops Are Needed To Defeat An Insurgency?

A paratrooper from the French Foreign Legion (1er REP) with a captured fellagha during the Algerian War (1954-1962). [Via Pinterest]

Today’s edition of TDI Friday Read is a compilation of posts addressing the question of manpower and counterinsurgency. The first four posts summarize research on the question undertaken during the first decade of the 21st century, while the Afghan and Iraqi insurgencies were in full bloom. Despite different research questions and analytical methodologies, each of the studies concluded that there is a relationship between counterinsurgent manpower and counterinsurgency outcomes.

The fifth post addresses the U.S. Army’s lack of a formal methodology for calculating manpower requirements for counterinsurgencies and contingency operations.

Force Ratios and Counterinsurgency

Force Ratios and Counterinsurgency II

Force Ratios and Counterinsurgency III

Force Ratios and Counterinsurgency IV

https://dupuyinstitute.org/2016/06/29/has-the-army-given-up-on-counterinsurgency-research-again/

Captured Records: WWII

At the end of World War I, the United States made sure they had access to the German military records in their treaty and sent a research team over there to review and copy the pertinent records. At the end of World War II, we just took everything. The allies gathered together all the material, in part because of concerns over war crimes, and eventually the entire collection was shipped back to the United States.

In the 1960s, the United States decided to repatriate the records back to Germany. Before they shipped them out they decided to copy the entire collection of German World War II records and place them on microfilm. This was a massive effort done by government contractors that took several years. Like all good government contracts, towards the end, this one was behind schedule, so they were having to cut corners by choosing not to copy things that they felt were not particularly relevant. So, the originals records are now in the archives in Freiburg, Germany (a nice little town about as far away from the old East German border as you can get). There are copies on microfilm of most of the German records, sometimes in disorder, in the U.S. Archives II in College Park, Maryland (near Washington DC). The British also have microfilm copies of portions of the German records collection that they captured over at the Public Records Office in Kew (London). The UK collection is a fraction the size of the U.S. microfilm collection, and as far as I know has nothing additional in it.

But, there were some records that were not copied by the U.S., but it is not much. For example, for the Kursk Data Base, I did the German research from the U.S. record collection. There were 17 German divisions in the offensive in the south in July 1943. I made a detailed listing of the records I have reviewed and sent that list to Dr. Arthur Volz over in Germany. He then went to Frieburg and tried to locate additional material on strength and losses from those files. About the only additional material he located was the panzer regiment files from the 11th Panzer Division, which were either not in the U.S. archives or I overlooked when I did my research. That was it. Overall, the original copying effort was pretty exhaustive.

There was one major gap for a long time. For a couple of decades, many of the original German situation maps were in the U.S., but no longer accessible. There were supposed to be copied and sent to Germany, but there was a budget issue. Meanwhile one researcher was handling them so poorly that they canceled access so as to protect them. They have finally copied them and sent the originals back to Germany.

New WWII German Maps At The National Archives

There are also no real Luftwaffe files. Most of the Luftwaffe files were placed on a train and when the order came down from Hitler to destroy everything….these weenies actually obeyed the order and burned all their records. There are also major gaps in the German records after July 1944. Every six months, the German army units wrapped up their records and sent them back to their central archives. Because the war ended in May 1945, many of the records for July-December 1944 never made it back to be filed. Same for the 1945 records. This is why the QJM (Quantified Judgment Model) was originally developed from Italian Campaign Data from 1943 through June 1944.

Anyhow, this is an extended discussion of captured records originally inspired by this post and started with the discussions below.

The Sad Story Of The Captured Iraqi DESERT STORM Documents

Captured Records: World War I

Survey of German WWI Records

 

Will Tax Reform Throttle A U.S. Defense Budget Increase?

John Conger recently reported in Defense One that the tax reform initiative championed by the Trump administration and Republican congressional leaders may torpedo an increase in the U.S. defense budget for 2018. Both the House and Senate have passed authorizations approving the Trump administration’s budget request for $574.5 billion in defense spending, which is $52 billion higher than the limit established by the Budget Control Act (BCA). However, the House and Senate also recently passed a concurrent 2018 budget resolution to facilitate passage of a tax reform bill that caps the defense budget at $522 billion as mandated by the BCA.

The House and Senate armed services committees continue to hammer out the terms of the 2018 defense authorization, which includes increases in troop strength and pay. These priorities could crowd out other spending requested by the services to meet strategic and modernization requirements if the budget remains capped. Congress also continues to resist the call by Secretary of Defense James Mattis to close unneeded bases and facilities, which could free spending for other needs. There is also little interest in reforming Defense Department business practices that allegedly waste $125 billion annually.

Congressional Republicans and Democrats were already headed toward a showdown over 2018 BCA limits on defense spending. Even before the tax reform push, several legislators predicted yet another year-long continuing resolution limiting government spending to the previous year’s levels. A bipartisan consensus existed among some armed services committee members that this would constitute “borderline legislative malpractice, particularly for the Department of Defense.”

Despite the ambitious timeline set by President Trump to pass a tax reform bill, the chances of a continuing resolution remain high. It also seems likely that any agreement to increase defense spending will be through the Overseas Contingency Operations budget, which is not subject to the BCA. Many in Congress agree with Democratic Representative Adam Smith that resorting to this approach is “a fiscal sleight of hand [that] would be bad governance and ‘hypocritical.’”

Are tax reform and increased defense spending incompatible? Stay tuned.

The CRS Casualty Estimates

Let’s just outline the specifics of the casualty estimates for a war with North Korea in the latest Congressional Research Service (CRS) report dated 27 October 2017: https://fas.org/sgp/crs/nuke/R44994.pdf

On page 18:

Even if the DPRK uses only its conventional munitions, estimates range from between 30,000 and 300,000 dead in the first days of fighting, given that DPRK artillery is thought to be capable of firing 10,000 rounds per minute at Seoul. One observer states

Estimates are that hundreds of thousands of South Koreans would die in the first few hours of combat–from artillery, from rockets, from short range missiles–and if this war would escalate to the nuclear level, then you are looking at tens of millions of casualties and the destruction of the eleventh largest economy in the world.

It does not appear that CRS has done any independent analysis of this issues. Its sources in the footnotes are articles from Reuters, New York Times, CNN, NAPSNet Special Reports and GlobalSecurity.

And on page 3:

Should the DPRK use the nuclear, chemical or biological weapons in its arsenal, according to some estimates casualty figures could number in the millions.

 

Casualty Estimates for a War with North Korea

There are a few casualty estimates out there of the cost of a war with North Korea. A couple of these casualty estimates are summarized in this article: https://www.yahoo.com/news/u-must-invade-north-korea-091003273.html

They are:

1. As many as 2.1 million could die if nuclear detonations occurred over Seoul and Tokyo (source: website 38 North, October 2017).

2. As many as 300,000 could die in the first few days of a conflict between North Korea and the U.S. even without the use of nuclear weapons (source: Congressional Research Service, 27 October 2017: https://fas.org/sgp/crs/nuke/R44994.pdf)

The Dupuy Institute has not done any casualty estimates or analysis of a war with North Korea, nor are we planning to at this juncture. We have done a few casualty estimates in the past:

Predictions

 

 

Assessing the TNDA 1990-91 Gulf War Forecast

Assessing the 1990-1991 Gulf War Forecasts

Forecasting U.S. Casualties in Bosnia

https://dupuyinstitute.org/2016/06/27/forecasting-the-iraqi-insurgency/

TDI Friday Read: Afghanistan

[SIGAR, Quarterly Report to Congress, 30 October 2017, p. 107]

While it is too soon to tell if the Trump Administration’s revised strategy in Afghanistan will make a difference, the recent report by the Special Inspector General for Afghanistan Reconstruction (SIGAR) to Congress documents the continued slow erosion of security in that country. Today’s edition of TDI Friday Read offers a selection of recent posts addressing some of the problems facing the U.S. counterinsurgent and stabilization missions there.

Afghanistan

Meanwhile, In Afghanistan…

We probably need to keep talking about Afghanistan

What will be our plans for Afghanistan?

Stalemate in Afghanistan

Troop Increase in Afghanistan?

Sending More Troops to Afghanistan

Mattis on Afghanistan

Deployed Troop Counts

Disappearing Statistics

 

 

Disappearing Statistics

There was a time during the Iraq insurgency when statistics on the war were readily available. As a small independent contractor, we were getting the daily feed of incidents, casualties and other such material during the Iraq War. It was one of the daily intelligence reports for Iraq. We had simply emailed someone in the field and were put on their distribution list, even though we had no presence in Iraq and no official position. This was public information so it was not a problem….until the second half of 2005…when suddenly the war was not going very well…then someone decided to restrict distribution. We received daily intelligence reports from 4 September 2004. They ended on 25 August 2005. There is more to this story, but maybe later.

This article was brought to my attention today: https://www.militarytimes.com/flashpoints/2017/10/30/report-us-officials-classify-crucial-metrics-on-afghan-casualties-readiness/

A few highlights:

  1. From January 1 to May 8 Afghan forces sustained 2,531 killed in action and 4,238 wounded (a 1.67-to-1 wounded-to-killed ratio, which seems very low).

  2. The Afghan armed forces control 56.8% of the 407 districts, a one percentage point drop over the last six months.

  3. The Afghan government controls 63.7% percent of the population.

  4. Some of these statistics will now be classified.

 

One of our older posts on wounded-to-killed ratios. I have an entire chapter on the subject in War by Numbers.

Wounded-To-Killed Ratios

The Historical Combat Effectiveness of Lighter-Weight Armored Forces

A Stryker Infantry Carrier Vehicle-Dragoon fires 30 mm rounds during a live-fire demonstration at Aberdeen Proving Ground, Md., Aug. 16, 2017. Soldiers with 2nd Cavalry Regiment spent six weeks at Aberdeen testing and training on the new Stryker vehicle and a remote Javelin system, which are expected to head to Germany early next year for additional user testing. (Photo Credit: Sean Kimmons)

In 2001, The Dupuy Institute conducted a study for the U.S. Army Center for Army Analysis (CAA) on the historical effectiveness of lighter-weight armored forces. At the time, the Army had developed a requirement for an Interim Armored Vehicle (IAV), lighter and more deployable than existing M1 Abrams Main Battle Tank and the M2 Bradley Infantry Fighting Vehicle, to form the backbone of the future “Objective Force.” This program would result in development of the Stryker Infantry Fighting Vehicle.

CAA initiated the TDI study at the request of Walter W. “Don” Hollis, then the Deputy Undersecretary of the Army for Operations Research (a position that was eliminated in 2006.) TDI completed and submitted “The Historical Combat Effectiveness of Lighter-Weight Armored Forces” to CAA in August 2001. It examined the effectiveness of light and medium-weight armored forces in six scenarios:

  • Conventional conflicts against an armor supported or armor heavy force.
  • Emergency insertions against an armor supported or armor heavy force.
  • Conventional conflict against a primarily infantry force (as one might encounter in sub-Saharan Africa).
  • Emergency insertion against a primarily infantry force.
  • A small to medium insurgency (includes an insurgency that develops during a peacekeeping operation).
  • A peacekeeping operation or similar Operation Other Than War (OOTW) that has some potential for violence.

The historical data the study drew upon came from 146 cases of small-scale contingency operations; U.S. involvement in Vietnam; German counterinsurgency operations in the Balkans, 1941-1945; the Philippines Campaign, 1941-42; the Normandy Campaign, 1944; the Korean War 1950-51; the Persian Gulf War, 1990-91; and U.S. and European experiences with light and medium-weight armor in World War II.

The major conclusions of the study were:

Small Scale Contingency Operations (SSCOs)

  1. Implications for the Interim Armored Vehicle (IAV) Family of Vehicles. It would appear that existing systems (M-2 and M-3 Bradley and M-113) can fulfill most requirements. Current plans to develop an advanced LAV-type vehicle may cover almost all other shortfalls. Mine protection is a design feature that should be emphasized.
  2. Implications for the Interim Brigade Combat Team (IBCT). The need for armor in SSCOs that are not conventional or closely conventional in nature is limited and rarely approaches the requirements of a brigade-size armored force.

Insurgencies

  1. Implications for the Interim Armored Vehicle (IAV) Family of Vehicles. It would appear that existing systems (M-2 and M-3 Bradley and M-113) can fulfill most requirements. The armor threat in insurgencies is very limited until the later stages if the conflict transitions to conventional war. In either case, mine protection is a design feature that may be critical.
  2. Implications for the Interim Brigade Combat Team (IBCT). It is the nature of insurgencies that rapid deployment of armor is not essential. The armor threat in insurgencies is very limited until the later stages if the conflict transitions to a conventional war and rarely approaches the requirements of a brigade-size armored force.

Conventional Warfare

Conventional Conflict Against An Armor Supported Or Armor Heavy Force

  1. Implications for the Interim Armored Vehicle (IAV) Family of Vehicles. It may be expected that opposing heavy armor in a conventional armor versus armor engagement could significantly overmatch the IAV. In this case the primary requirement would be for a weapon system that would allow the IAV to defeat the enemy armor before it could engage the IAV.
  2. Implications for the Interim Brigade Combat Team (IBCT). The IBCT could substitute as an armored cavalry force in such a scenario.

Conventional Conflict Against A Primarily Infantry Force

  1. Implications for the Interim Armored Vehicle (IAV) Family of Vehicles. This appears to be little different from those conclusions found for the use of armor in SSCOs and Insurgencies.
  2. Implications for the Interim Brigade Combat Team (IBCT). The lack of a major armor threat will make the presence of armor useful.

Emergency Insertion Against An Armor Supported Or Armor Heavy Force

  1. Implications for the Interim Armored Vehicle (IAV) Family of Vehicles. It appears that the IAV may be of great use in an emergency insertion. However, the caveat regarding the threat of being overmatched by conventional heavy armor mentioned above should not be ignored. In this case the primary requirement would be for a weapon system that would allow the IAV to defeat the enemy armor before it could engage the IAV.
  2. Implications for the Interim Brigade Combat Team (IBCT). Although the theoretical utility of the IBCT in this scenario may be great it should be noted that The Dupuy Institute was only able to find one comparable case of such a deployment which resulted in actual conflict in US military history in the last 60 years (Korea, 1950). In this case the effect of pushing forward light tanks into the face of heavier enemy tanks was marginal.

Emergency Insertion Against A Primarily Infantry Force

  1. Implications for the Interim Armored Vehicle (IAV) Family of Vehicles. The lack of a major armor threat in this scenario will make the presence of any armor useful. However, The Dupuy Institute was unable to identify the existence of any such cases in the historical record.
  2. Implications for the Interim Brigade Combat Team (IBCT). The lack of a major armor threat will make the presence of any armor useful. However, The Dupuy Institute was unable to identify the existence of any such cases in the historical record.

Other Conclusions

Wheeled Vehicles

  1. There is little historical evidence one way or the other establishing whether wheels or tracks are the preferable feature of AFVs.

Vehicle Design

  1. In SSCOs access to a large-caliber main gun was useful for demolishing obstacles and buildings. This capability is not unique and could be replaced by AT missiles armed CFVs, IFVs and APCs.
  2. Any new lighter tank-like vehicle should make its gun system the highest priority, armor secondary and mobility and maneuverability tertiary.
  3. Mine protection should be emphasized. Mines were a major threat to all types of armor in many scenarios. In many SSCOs it was the major cause of armored vehicle losses.
  4. The robust carrying capacity offered by an APC over a tank is an advantage during many SSCOs.

Terrain Issues

  1. The use of armor in urban fighting, even in SSCOs, is still limited. The threat to armor from other armor in urban terrain during SSCOs is almost nonexistent. Most urban warfare armor needs, where armor basically serves as a support weapon, can be met with light armor (CFVs, IFVs, and APCs).
  2. Vehicle weight is sometimes a limiting factor in less developed areas. In all cases where this was a problem, there was not a corresponding armor threat. As such, in almost all cases, the missions and tasks of a tank can be fulfilled with other light armor (CFVs, IFVs, or APCs).
  3. The primary terrain problem is rivers and flooded areas. It would appear that in difficult terrain, especially heavily forested terrain (areas with lots of rainfall, like jungles), a robust river crossing capability is required.

Operational Factors

  1. Emergency insertions and delaying actions sometimes appear to be a good way to lose lots of armor for limited gain. This tends to come about due to terrain problems, enemy infiltration and bypassing, and the general confusion prevalent in such operations. The Army should be careful not to piecemeal assets when inserting valuable armor resources into a ‘hot’ situation. In many cases holding back and massing the armor for defense or counter-attack may be the better option.
  2. Transportability limitations have not been a major factor in the past for determining whether lighter or heavier armor were sent into a SSCO or a combat environment.

Casualty Sensitivity

  1. In a SSCO or insurgency, in most cases the weight and armor of the AFVs is not critical. As such, one would not expect any significant changes in losses regardless of the type of AFV used (MBT, medium-weight armor, or light armor). However, the perception that US forces are not equipped with the best-protected vehicle may cause some domestic political problems. The US government is very casualty sensitive during SSCOs. Furthermore, the current US main battle tank particularly impressive, and may help provide some additional intimidation in SSCOs.
  2. In any emergency insertion scenario or conventional war scenario, the use of lighter armor could result in higher US casualties and lesser combat effectiveness. This will certainly cause some domestic political problems and may impact army morale. However by the same token, light infantry forces, unsupported by easily deployable armor could present a worse situation.

U.S. Army Solicits Proposals For Mobile Protected Firepower (MPF) Light Tank

The U.S. Army’s late and apparently lamented M551 Sheridan light tank. [U.S. Department of the Army/Wikipedia]

The U.S. Army recently announced that it will begin soliciting Requests for Proposal (RFP) in November to produce a new lightweight armored vehicle for its Mobile Protected Firepower (MPF) program. MPF is intended to field a company of vehicles for each Army Infantry Brigade Combat Team to provide them with “a long-range direct-fire capability for forcible entry and breaching operations.”

The Army also plans to field the new vehicle quickly. It is dispensing with the usual two-to-three year technology development phase, and will ask for delivery of the first sample vehicles by April 2018, one month after the RFP phase is scheduled to end. This will invariably favor proposals using existing off-the-shelf vehicle designs and “mature technology.”

The Army apparently will also accept RFPs with turret-mounted 105mm main guns, at least initially. According to previous MFP parameters, acceptable designs will eventually need to be able to accommodate 120mm guns.

I have observed in the past that the MPF is the result of the Army’s concerns that its light infantry may be deprived of direct fire support on anti-access/area denial (A2/AD) battlefields. Track-mounted, large caliber direct fire guns dedicated to infantry support are something of a doctrinal throwback to the assault guns of World War II, however.

There was a noted tendency during World War II to use anything on the battlefield that resembled a tank as a main battle tank, with unhappy results for the not-main battle tanks. As a consequence, assault guns, tank destroyers, and light tanks became evolutionary dead-ends in the development of post-World War II armored doctrine (the late M551 Sheridan, retired without replacement in 1996, notwithstanding). [For more on the historical background, see The Dupuy Institute, “The Historical Effectiveness of Lighter-Weight Armored Forces,” August 2001.]

The Army has been reluctant to refer to MPF as a light tank, but as David Dopp, the MPF Program Manager admitted, “I don’t want to say it’s a light tank, but it’s kind of like a light tank.” He went on to say that “It’s not going toe to toe with a tank…It’s for the infantry. It goes where the infantry goes — it breaks through bunkers, it works through targets that the infantry can’t get through.”

Major General David Bassett, program executive officer for the Army’s Ground Combat Systems concurred. It will be a tracked vehicle with substantial armor protection, Bassett said, “but certainly not what you’d see on a main battle tank.”

It will be interesting to see what the RFPs have to offer.

Previous TDI commentaries on the MPF Program:

https://dupuyinstitute.org/2016/10/19/back-to-the-future-the-mobile-protected-firepower-mpf-program/

https://dupuyinstitute.org/2017/03/21/u-s-army-moving-forward-with-mobile-protected-firepower-mpf-program/

Validating Trevor Dupuy’s Combat Models

[The article below is reprinted from Winter 2010 edition of The International TNDM Newsletter.]

A Summation of QJM/TNDM Validation Efforts

By Christopher A. Lawrence

There have been six or seven different validation tests conducted of the QJM (Quantified Judgment Model) and the TNDM (Tactical Numerical Deterministic Model). As the changes to these two models are evolutionary in nature but do not fundamentally change the nature of the models, the whole series of validation tests across both models is worth noting. To date, this is the only model we are aware of that has been through multiple validations. We are not aware of any DOD [Department of Defense] combat model that has undergone more than one validation effort. Most of the DOD combat models in use have not undergone any validation.

The Two Original Validations of the QJM

After its initial development using a 60-engagement WWII database, the QJM was tested in 1973 by application of its relationships and factors to a validation database of 21 World War II engagements in Northwest Europe in 1944 and 1945. The original model proved to be 95% accurate in explaining the outcomes of these additional engagements. Overall accuracy in predicting the results of the 81 engagements in the developmental and validation databases was 93%.[1]

During the same period the QJM was converted from a static model that only predicted success or failure to one capable of also predicting attrition and movement. This was accomplished by adding variables and modifying factor values. The original QJM structure was not changed in this process. The addition of movement and attrition as outputs allowed the model to be used dynamically in successive “snapshot” iterations of the same engagement.

From 1973 to 1979 the QJM’s formulae, procedures, and variable factor values were tested against the results of all of the 52 significant engagements of the 1967 and 1973 Arab-Israeli Wars (19 from the former, 33 from the latter). The QJM was able to replicate all of those engagements with an accuracy of more than 90%?[2]

In 1979 the improved QJM was revalidated by application to 66 engagements. These included 35 from the original 81 engagements (the “development database”), and 31 new engagements. The new engagements included five from World War II and 26 from the 1973 Middle East War. This new validation test considered four outputs: success/failure, movement rates, personnel casualties, and tank losses. The QJM predicted success/failure correctly for about 85% of the engagements. It predicted movement rates with an error of 15% and personnel attrition with an error of 40% or less. While the error rate for tank losses was about 80%, it was discovered that the model consistently underestimated tank losses because input data included all kinds of armored vehicles, but output data losses included only numbers of tanks.[3]

This completed the original validations efforts of the QJM. The data used for the validations, and parts of the results of the validation, were published, but no formal validation report was issued. The validation was conducted in-house by Colonel Dupuy’s organization, HERO [Historical Evaluation Research Organization]. The data used were mostly from division-level engagements, although they included some corps- and brigade-level actions. We count these as two separate validation efforts.

The Development of the TNDM and Desert Storm

In 1990 Col. Dupuy, with the collaborative assistance of Dr. James G. Taylor (author of Lanchester Models of Warfare [vol. 1] [vol. 2], published by the Operations Research Society of America, Arlington, Virginia, in 1983) introduced a significant modification: the representation of the passage of time in the model. Instead of resorting to successive “snapshots,” the introduction of Taylor’s differential equation technique permitted the representation of time as a continuous flow. While this new approach required substantial changes to the software, the relationship of the model to historical experience was unchanged.[4] This revision of the model also included the substitution of formulae for some of its tables so that there was a continuous flow of values across the individual points in the tables. It also included some adjustment to the values and tables in the QJM. Finally, it incorporated a revised OLI [Operational Lethality Index] calculation methodology for modem armor (mobile fighting machines) to take into account all the factors that influence modern tank warfare.[5] The model was reprogrammed in Turbo PASCAL (the original had been written in BASIC). The new model was called the TNDM (Tactical Numerical Deterministic Model).

Building on its foundation of historical validation and proven attrition methodology, in December 1990, HERO used the TNDM to predict the outcome of, and losses from, the impending Operation DESERT STORM.[6] It was the most accurate (lowest) public estimate of U.S. war casualties provided before the war. It differed from most other public estimates by an order of magnitude.

Also, in 1990, Trevor Dupuy published an abbreviated form of the TNDM in the book Attrition: Forecasting Battle Casualties and Equipment Losses in Modern War. A brief validation exercise using 12 battles from 1805 to 1973 was published in this book.[7] This version was used for creation of M-COAT[8] and was also separately tested by a student (Lieutenant Gozel) at the Naval Postgraduate School in 2000.[9] This version did not have the firepower scoring system, and as such neither M-COAT, Lieutenant Gozel’s test, nor Colonel Dupuy’s 12-battle validation included the OLI methodology that is in the primary version of the TNDM.

For counting purposes, I consider the Gulf War the third validation of the model. In the end, for any model, the proof is in the pudding. Can the model be used as a predictive tool or not? If not, then there is probably a fundamental flaw or two in the model. Still the validation of the TNDM was somewhat second-hand, in the sense that the closely-related previous model, the QJM, was validated in the 1970s to 200 World War II and 1967 and 1973 Arab-Israeli War battles, but the TNDM had not been. Clearly, something further needed to be done.

The Battalion-Level Validation of the TNDM

Under the guidance of Christopher A. Lawrence, The Dupuy Institute undertook a battalion-level validation of the TNDM in late 1996. This effort tested the model against 76 engagements from World War I, World War II, and the post-1945 world including Vietnam, the Arab-Israeli Wars, the Falklands War, Angola, Nicaragua, etc. This effort was thoroughly documented in The International TNDM Newsletter.[10] This effort was probably one of the more independent and better-documented validations of a casualty estimation methodology that has ever been conducted to date, in that:

  • The data was independently assembled (assembled for other purposes before the validation) by a number of different historians.
  • There were no calibration runs or adjustments made to the model before the test.
  • The data included a wide range of material from different conflicts and times (from 1918 to 1983).
  • The validation runs were conducted independently (Susan Rich conducted the validation runs, while Christopher A. Lawrence evaluated them).
  • The results of the validation were fully published.
  • The people conducting the validation were independent, in the sense that:

a) there was no contract, management, or agency requesting the validation;
b) none of the validators had previously been involved in designing the model, and had only very limited experience in using it; and
c) the original model designer was not able to oversee or influence the validation.[11]

The validation was not truly independent, as the model tested was a commercial product of The Dupuy Institute, and the person conducting the test was an employee of the Institute. On the other hand, this was an independent effort in the sense that the effort was employee-initiated and not requested or reviewed by the management of the Institute. Furthermore, the results were published.

The TNDM was also given a limited validation test back to its original WWII data around 1997 by Niklas Zetterling of the Swedish War College, who retested the model to about 15 or so Italian campaign engagements. This effort included a complete review of the historical data used for the validation back to their primarily sources, and details were published in The International TNDM Newsletter.[12]

There has been one other effort to correlate outputs from QJM/TNDM-inspired formulae to historical data using the Ardennes and Kursk campaign-level (i.e., division-level) databases.[13] This effort did not use the complete model, but only selective pieces of it, and achieved various degrees of “goodness of fit.” While the model is hypothetically designed for use from squad level to army group level, to date no validation has been attempted below battalion level, or above division level. At this time, the TNDM also needs to be revalidated back to its original WWII and Arab-Israeli War data, as it has evolved since the original validation effort.

The Corps- and Division-level Validations of the TNDM

Having now having done one extensive battalion-level validation of the model and published the results in our newsletters, Volume 1, issues 5 and 6, we were then presented an opportunity in 2006 to conduct two more validations of the model. These are discussed in depth in two articles of this issue of the newsletter.

These validations were again conducted using historical data, 24 days of corps-level combat and 25 cases of division-level combat drawn from the Battle of Kursk during 4-15 July 1943. It was conducted using an independently-researched data collection (although the research was conducted by The Dupuy Institute), using a different person to conduct the model runs (although that person was an employee of the Institute) and using another person to compile the results (also an employee of the Institute). To summarize the results of this validation (the historical figure is listed first followed by the predicted result):

There was one other effort that was done as part of work we did for the Army Medical Department (AMEDD). This is fully explained in our report Casualty Estimation Methodologies Study: The Interim Report dated 25 July 2005. In this case, we tested six different casualty estimation methodologies to 22 cases. These consisted of 12 division-level cases from the Italian Campaign (4 where the attack failed, 4 where the attacker advanced, and 4 Where the defender was penetrated) and 10 cases from the Battle of Kursk (2 cases Where the attack failed, 4 where the attacker advanced and 4 where the defender was penetrated). These 22 cases were randomly selected from our earlier 628 case version of the DLEDB (Division-level Engagement Database; it now has 752 cases). Again, the TNDM performed as well as or better than any of the other casualty estimation methodologies tested. As this validation effort was using the Italian engagements previously used for validation (although some had been revised due to additional research) and three of the Kursk engagements that were later used for our division-level validation, then it is debatable whether one would want to call this a seventh validation effort. Still, it was done as above with one person assembling the historical data and another person conducting the model runs. This effort was conducted a year before the corps and division-level validation conducted above and influenced it to the extent that we chose a higher CEV (Combat Effectiveness Value) for the later validation. A CEV of 2.5 was used for the Soviets for this test, vice the CEV of 3.0 that was used for the later tests.

Summation

The QJM has been validated at least twice. The TNDM has been tested or validated at least four times, once to an upcoming, imminent war, once to battalion-level data from 1918 to 1989, once to division-level data from 1943 and once to corps-level data from 1943. These last four validation efforts have been published and described in depth. The model continues, regardless of which validation is examined, to accurately predict outcomes and make reasonable predictions of advance rates, loss rates and armor loss rates. This is regardless of level of combat (battalion, division or corps), historic period (WWI, WWII or modem), the situation of the combats, or the nationalities involved (American, German, Soviet, Israeli, various Arab armies, etc.). As the QJM, the model was effectively validated to around 200 World War II and 1967 and 1973 Arab-Israeli War battles. As the TNDM, the model was validated to 125 corps-, division-, and battalion-level engagements from 1918 to 1989 and used as a predictive model for the 1991 Gulf War. This is the most extensive and systematic validation effort yet done for any combat model. The model has been tested and re-tested. It has been tested across multiple levels of combat and in a wide range of environments. It has been tested where human factors are lopsided, and where human factors are roughly equal. It has been independently spot-checked several times by others outside of the Institute. It is hard to say what more can be done to establish its validity and accuracy.

NOTES

[1] It is unclear what these percentages, quoted from Dupuy in the TNDM General Theoretical Description, specify. We suspect it is a measurement of the model’s ability to predict winner and loser. No validation report based on this effort was ever published. Also, the validation figures seem to reflect the results after any corrections made to the model based upon these tests. It does appear that the division-level validation was “incremental.” We do not know if the earlier validation tests were tested back to the earlier data, but we have reason to suspect not.

[2] The original QJM validation data was first published in the Combat Data Subscription Service Supplement, vol. 1, no. 3 (Dunn Loring VA: HERO, Summer 1975). (HERO Report #50) That effort used data from 1943 through 1973.

[3] HERO published its QJM validation database in The QJM Data Base (3 volumes) Fairfax VA: HERO, 1985 (HERO Report #100).

[4] The Dupuy Institute, The Tactical Numerical Deterministic Model (TNDM): A General and Theoretical Description, McLean VA: The Dupuy Institute, October 1994.

[5] This had the unfortunate effect of undervaluing WWII-era armor by about 75% relative to other WWII weapons when modeling WWII engagements. This left The Dupuy Institute with the compromise methodology of using the old OLI method for calculating armor (Mobile Fighting Machines) when doing WWII engagements and using the new OLI method for calculating armor when doing modem engagements

[6] Testimony of Col. T. N. Dupuy, USA, Ret, Before the House Armed Services Committee, 13 Dec 1990. The Dupuy Institute File I-30, “Iraqi Invasion of Kuwait.”

[7] Trevor N. Dupuy, Attrition: Forecasting Battle Casualties and Equipment Losses in Modern War (HERO Books, Fairfax, VA, 1990), 123-4.

[8] M-COAT is the Medical Course of Action Tool created by Major Bruce Shahbaz. It is a spreadsheet model based upon the elements of the TNDM provided in Dupuy’s Attrition (op. cit.) It used a scoring system derived from elsewhere in the U.S. Army. As such, it is a simplified form of the TNDM with a different weapon scoring system.

[9] See Gözel, Ramazan. “Fitting Firepower Score Models to the Battle of Kursk Data,” NPGS Thesis. Monterey CA: Naval Postgraduate School.

[10] Lawrence, Christopher A. “Validation of the TNDM at Battalion Level.” The International TNDM Newsletter, vol. 1, no. 2 (October 1996); Bongard, Dave “The 76 Battalion-Level Engagements.” The International TNDM Newsletter, vol. 1, no. 4 (February 1997); Lawrence, Christopher A. “The First Test of the TNDM Battalion-Level Validations: Predicting the Winner” and “The Second Test of the TNDM Battalion-Level Validations: Predicting Casualties,” The International TNDM Newsletter, vol. 1 no. 5 (April 1997); and Lawrence, Christopher A. “Use of Armor in the 76 Battalion-Level Engagements,” and “The Second Test of the Battalion-Level Validation: Predicting Casualties Final Scorecard.” The International TNDM Newsletter, vol. 1, no. 6 (June 1997).

[11] Trevor N. Dupuy passed away in July 1995, and the validation was conducted in 1996 and 1997.

[12] Zetterling, Niklas. “CEV Calculations in Italy, 1943,” The International TNDM Newsletter, vol. 1, no. 6. McLean VA: The Dupuy Institute, June 1997. See also Research Plan, The Dupuy Institute Report E-3, McLean VA: The Dupuy Institute, 7 Oct 1998.

[13] See Gözel, “Fitting Firepower Score Models to the Battle of Kursk Data.”