\documentclass{article} \usepackage{setspace} \usepackage{amssymb} \usepackage{amsmath} \usepackage{pdfpages} \usepackage{tikz} \usetikzlibrary{tikzmark,decorations.pathreplacing,calc} \usepackage{hyperref} \usepackage{multirow} \usepackage[nottoc]{tocbibind} \usepackage{caption} \usepackage{subcaption} \hangindent = -1in \usepackage[margin = 1in, includeheadfoot]{geometry} \usepackage{setspace} \doublespacing \begin{document} \thispagestyle{empty} { % set font to helvetica (arial) to make it 508-compliant \singlespacing \fontfamily{phv}\selectfont \begin{center} {\LARGE \textbf{An Examination of Gravity Relationships at the Subnational Level}} \\ \vspace{0.25in} {\Large Paul Phillips} \\ \vspace{1.5in} {\large ECONOMICS WORKING PAPER SERIES}\\ Working Paper 2024--05--A \\ \vspace{0.5in} U.S. INTERNATIONAL TRADE COMMISSION \\ 500 E Street SW \\ Washington, DC 20436 \\ \vspace{0.5in} May 2024 \\ \end{center} \vfill \noindent Office of Economics working papers are the result of ongoing professional research of USITC Staff and are solely meant to represent the opinions and professional research of individual authors. These papers are not meant to represent in any way the views of the U.S. International Trade Commission or any of its individual Commissioners. \newpage \thispagestyle{empty} % remove headers, footers, and page numbers from cover page \begin{flushleft} An Examination of Gravity Relationships at the Subnational Level \\ Paul Phillips \\ Economics Working Paper 2024--05--A\\ April 2024 \\~\\ \end{flushleft} \vfill \begin{abstract} \noindent This paper uses data on shipments between U.S. regions to estimate a subnational version of the canonical international gravity equation. Using a CES utility framework, I estimate gravity regressions using a data set comprised of 132 subnational regions and then extract trade cost values for each origin--destination pair, breaking down results by sector and mode of transportation. All forty--two sectors show a statistically significant negative relationship between distance and trade, with the elasticity of trade with respect to distance being higher for raw materials than for manufactured goods. Among modes of transportation, distance more negatively affects pipeline shipments, and the effect of distance on air shipments is not statistically significant. With other factors being equal, shipping goods between regions costs around four times as much on median as shipping goods within a region, although this relationship varies widely depending on the mode of transportation or the sector. \end{abstract} \vfill \begin{flushleft} Paul Phillips \\ Research Division, Office of Economics\\ U.S. International Trade Commission\\ Paul.Phillips@usitc.gov \\ \end{flushleft} } % end of helvetica (arial) font \clearpage \newpage \section{Introduction} \indent Since Tinbergen (1962) debuted the first gravity model, gravity estimation has been a prevalent field within international trade economics. A typical gravity estimation procedure involves the regression of bilateral trade between two countries on the distance between those two countries as well as shared characteristics including a shared border, the presence of a common language, the presence of a common currency, or colonial ties. Researchers use gravity estimations to inform their understanding of the factors influencing trade between two nations, or to calibrate the iceberg transportation costs that they can then input into a structural model. \\ \indent However, economists have not yet, to this author's knowledge, applied gravity regression estimation to analyze the factors predicting trade between granular regions \textit{within} the United States. The majority of papers analyzing subnational trade focus on the distributional effects of international trade policies throughout the country, and they do so either by assuming that import exposure rates are the same across the country or by using data at the level of states or aggregated Census areas,\footnote{The Census Bureau uses the term `regions' when delineating subdivisions of the United States. The Freight Analysis Framework, my main data source for this paper, also describes its U.S. subdivisions as regions. To avoid confusion, I will use the term `areas' when referring to Census subdivisions.} where data is more reliable and more readily available than at the level of counties or metropolitan areas. Another strand of literature has used subnational trade statistics and GIS (Global Information System) data on transportation networks to map out trade costs within the United States. In this paper, I perform the more basic task of examining how the characteristics of a pair of U.S. regions influence the amount of trade conducted between these two regions. The results of this analysis provide insight into how gravity variables affect trade at the subnational level, as well as providing further guidance on how to calibrate trade costs within the United States in a structural model. \\ \indent I obtain subnational shipment data from the Freight Analysis Framework (FAF), which reports the value and weight of shipments between 132 subnational regions. I use a simplified gravity estimation method that allows me to divide data into forty--two sectors and five modes of transportation. Explanatory variables within the gravity equation include distance and binary indicators of whether the two regions are in the same state or broader Census area, as well as other geographic and demographic factors that the regions may share in common. I then apply these gravity equation results to a standard CES model to obtain estimates of trade costs\footnote{Throughout this paper, I use the terms `trade costs' and `transportation costs' synonymously.} between different regions of the country. \\ \indent Gravity equation results reveal that trade elasticities with respect to distance tend to be lower for manufactured goods than they are for raw materials, while distance elasticities are lower for goods transported across rail and air than for goods transported by truck, pipeline, or water. Gravity equation results do not differ significantly between 2018 and the following years, suggesting that the pandemic and trade wars with China did not have a strong effect on the gravity mechanisms behind subnational trade. The median trade cost between two subnational regions is around five, meaning that shipping goods between two regions of the country costs around 400\% more than shipping goods within the same region. Median trade costs in the agricultural sector are higher than median trade costs in manufacturing sectors and median trade costs in air, water, and trade costs in rail transportation are lower than those in truck and pipeline transportation. \\ \indent Section 2 reviews some of the literature on economic geography and subnational trade. Section 3 goes through the data, while Section 4 introduces the gravity equation methodology. Section 5 presents the gravity equation results, and Section 6 presents estimates of transportation costs. Section 7 concludes. %explain different sections \section{Literature Review} The typical gravity estimation procedure entails regressing trade between two countries on distance and a bevy of bilateral country characteristics, including shared border, common language, common currency, colonial ties, or the presence of a regional trade agreement between the two countries. Fixed effects control for factors specific to one country, and gravity regressions either employ a fixed effects estimation or Pseudo--Poisson Maximum Likelihood (PPML) framework. The U.S. International Trade Commission provides a database of country--level gravity variables across time (Gurevich and Herman, 2018). Recent papers in the academic literature that perform gravity estimation include Anderson, Larch, and Yotov (2022); Yotov, Larch, and Heid (2015); Agnosteva, Anderson, and Yotov (2014); Olivero and Yotov (2012); Lin and Sim (2012); Helpman, Melitz and Rubinstein (2008). To the author's knowledge, the gravity literature has not yet considered estimating the relationship between gravity factors and trade at the subnational level. \\ \indent Papers that use subnational trade data will often do so in order to estimate the regional effects of a national trade policy. These papers often assume that import exposure is the same regardless of location within the country. Reduced--form papers such as Hakobyan and McLaren (2016); Acemoglu, Autor, Dorn, Hanson, and Price (2016); and Autor, Dorn and Hanson (2013) determine local labor market exposure to trade competition using localized employment data and national import data. The model--based analyses in Caliendo and Parro (2021) and Caliendo, Dvorkin and Parro (2019) contain trade costs at the local level, but calculate subnational import shares based on national trade data from the World Input--Output Database and regional variation in employment. Other papers do not make such a strong assumption about import exposure, but due to data limitations they perform their analysis at the state or Census region level rather than any more disaggregated subnational areas. Riker (2022) and Feenstra and Hong (2023, working paper) use subnational trade data to determine how changes in trade policy will affect different states, while Riker (2019) performs a similar analysis for U.S. census regions. Riker (2022) uses subnational shipment data from the Commodity Flow Survey, while Feenstra and Hong use the FAF. Analysis done at the state or regional level can help increase data reliability but nonetheless obscures potentially large regional differences within states. The gravity analysis discussed here employs data for 132 sub--state regions, allowing for insights into how these more granular factors can influence trade. \\ \indent Another strand of the economics literature uses granular subnational data to estimate a network of trade costs across the country. Recent papers do so in order to model the welfare effects of a specific transportation policy. These papers include Fuchs and Wong (2023) for port congestion, Allen and Arkolakis (2022) for road congestion, Jaworski, Kitchens, and Nigai (2023) for the Interstate Highway System, and Donaldson and Hornbeck (2016) for the historical development of U.S. railroads. Allen and Arkolakis (2014) use a general equilibrium model to estimate the distribution of trade costs, productivities and amenities in the United States, while also calculating the welfare effects of constructing the Interstate Highway System. Meanwhile, Agnosteva, Anderson and Yotov (2014) and Anderson and van Wincoop (2003) use subnational data to estimate different transportation costs, such as the transportation cost imposed by distance and the trade cost associated with crossing a state or international border. Agnosteva et al. focus on Canadian provinces, while Anderson and van Wincoop's model uses data from Canadian provinces and U.S. states. This literature often focuses on one mode of transportation, while not distinguishing between sectors. In comparison, my use of a more simplified gravity framework allows me to examine forty--two disaggregated sectors and five modes of transportation simultaneously, allowing for insights into how gravity results may differ at the sectoral level. \\ \indent By using a CES--driven gravity framework with detailed subnational trade data, this paper provides clear insight into how geographic and demographic variables relate with trade at the subnational level. The set--up of the paper also allows for insights into how gravity results may differ between sectors or between different modes of transportation. \section{Data} I use data from the Freight Analysis Framework, which provides figures on inflows and outflows between 132 subnational regions. In this paper, I focus on how geographic or demographic characteristics affect a shipment that passes between two subnational regions, regardless of whether this shipment was necessarily produced in the first region or consumed in the second. Although the FAF breaks subnational trade flows down into shipments traded exclusively within the U.S. and domestic components of international shipments, I use information on shipments sent from one region within the U.S. to another, aggregated across all initial origins or ultimate destinations.\footnote{The FAF does not distinguish between shipments that terminate at their recorded destination and transshipments whose recorded destination is a temporary warehousing. While this would complicate the task of estimating subnational import exposure rates, it has no bearing on the gravity analysis contained within this paper.} \\ \indent The regions represent both metropolitan areas and non--metropolitan areas, although not all U.S. metropolitan areas are represented in the dataset. The FAF either represents each state in its entirety, as is the case with less--populated states such as Alaska and Idaho, or divides states into one or more metropolitan areas and the `rest of the state', which includes all counties not represented by one of the aforementioned metropolitan areas. For example, the FAF divides Georgia into Atlanta, Savannah, and `Rest of Georgia.' \\ \indent The FAF data includes forty--two SCTG2 sectors and eight modes of transportation. The eight modes of transportation are truck, rail, water, air (including routes with both truck and air components), multiple modes and mail, pipeline, other and unknown, and no domestic mode. The `multiple modes and mail' category includes all trade involving intermodal switching other than truck--air, but does not include more detailed information on the different modes involved or when the switching occurs. `Other and unknown' and `no domestic mode' are limited in their number of observations, and any results in these areas would be difficult to interpret. I therefore focus on the five major transport modes: truck, rail, water, air, and pipeline. \subsection{Descriptive Statistics} Subnational trade flows encompass 132 regions, forty--two sectors, and five modes of transportation, for a total of around six million observations. Any kind of comprehensive analysis would be impossible, but this section highlights some of the general trends observed in U.S. subnational trade in 2018. \\ \indent As shown in Figure 1 on the following page, the overwhelming majority of goods within the United States are shipped by truck, with truck travel accounting for almost three--quarters of subnational trade by value. Trade over water, rail, and air routes all account for less than five percent of total trade, with goods shipped by pipeline covering about six percent of subnational trade values. \begin{figure}[h!] \centering \includegraphics[width = .5\linewidth]{modepiechart.png} %ALT TEXT: This pie chart shows the percentage of the value of all subnational trade shipped using each mode of transportation. Major modes are truck, rail, water, air, and pipeline. \caption{Breakdown of subnational trade by type of transportation, NAICS} \end{figure} \\ \begin{table}[h!] \centering \begin{tabular}{|c|c|c|c|c|c|c|} \hline \multicolumn{2}{|c|}{Statistic} & Truck & Rail & Water & Air & Pipeline \\ \hline \multicolumn{2}{|c|}{Proportion of agricultural trade} & .898 & .0389 & .0160 & $2.36\times10^{-3}$ & 0 \\ \hline \multicolumn{2}{|c|}{Proportion of trade in energy} & .464 & .0352 & .0690 & $1.14\times10^{-5}$ & .417 \\ \hline \multicolumn{2}{|c|}{Proportion of trade in motor vehicles} & .764 & .0862 & .00464 & $.00599$ & 0 \\ \hline \end{tabular} \caption{Breakdown of subnational trade by mode of transportation for selected sectors} %ALT TEXT: For certain important sectors, I display the fraction of subnational trade in that sector shipped using truck, water, rail, air, or pipeline. The objective is to see how the mode-level breakdown differs by sector. \label{table:1} \end{table} \\ Table 1 shows that the breakdown of subnational shipments by type of transportation varies by sector. For agricultural goods, which include FAF sectors 1--9, almost ninety percent of all shipments are by truck, while rail and air shipments are nearly nonexistent. Energy goods, which include FAF sectors 15--19, are transported by pipeline almost as often as by truck, and the fraction of goods shipped by water is also higher for energy goods than it is overall. Around three--quarters of the value of motor vehicles trade is by truck, with rail shipments comprising 8.6 percent of motor vehicles trade compared with three percent of trade generally. \begin{table}[h!] \centering \begin{tabular}{|c|c|c|c|c|c|} \hline & Truck & Rail & Water & Air & Pipeline \\ \hline Pairs with zero trade & .468\% & 11.9\% & 36.5\% & 9.53\% & 11.2\% \\ \hline Largest origin (volume) & Los Angeles & Detroit & New Orleans & Los Angeles & Rest of TX \\ \hline Largest destination (volume) & New York, NY & Detroit & New Orleans & Los Angeles & Houston, TX \\ \hline Largest origin (fraction) & Rest of MD & ND & AK & AK & ND \\ \hline Largest destination (fraction) & Fresno, CA & Portland, OR & New Orleans & AK & Beaumont, TX \\ \hline \end{tabular} \caption{Breakdown of subnational trade by mode of transportation for regions} %ALT TEXT: More info on the different modes of transportation. Now I am interested in 1) the fraction of origin/destination pairs that do not trade using that mode of transportation, 2) the identity of the region for each mode with the largest volume of trade passing through and 3) for each mode, which region has the highest fraction of trade passing through using that mode. \label{table:1} \end{table} \\ The vast majority of U.S. regions send shipments to each other. Only 144 regional pairs out of the 17424 total pairs do not trade with each other in any capacity. Of the regional pairs that trade with each other, fewer than one percent do not conduct trade using trucks. However, the other modes of transportation are more specialized. 11.9\% of regional pairs do not send goods to each other by railroad, 36.5\% do not trade over water, 9.53\% do not trade via airplane and 11.2\% do not send goods via pipeline. \\ \indent Due to its large population and proximity to China and Mexico, Los Angeles, CA is the largest origin region by volume for trade transported by truck and air. Los Angeles is also the highest destination region for air trade, but New York City, NY is the largest destination region for truck due to its large customer base. Detroit, MI conducts the most trade by rail both as an origin and as a destination, with New Orleans, LA conducting the most trade by water. As Texas is a center of oil production, the highest volume of pipeline trade both by origin and destination is recorded in Texas. When regions are compared by the \textit{fraction} of trade conducted by each mode of transportation, statistics change somewhat. North Dakota records the highest fraction of outbound rail and pipeline trade in the country, with Fresno, CA receiving a high proportion of inbound truck trade. Alaska has a high proportion of air trade both incoming and outgoing, as its position renders rail or truck trade to be difficult. \begin{figure}[h!] \centering \includegraphics[width = .8\linewidth]{Fraction_of_Shipments.png} \caption{Geographical distribution of subnational shipments} %ALT TEXT: Categorization of subnational U.S. regions according to how the volume of subnational trade traffic each region receives. \end{figure} \\ In Figure 2, I display the amount of subnational trade that passes through each FAF subnational region, either as an origin or as a destination. Los Angeles, CA is the region through which the most subnational shipments pass, comprising 2.553 trillion dollars and 6.34\% of the value of all subnational shipments. Chicago, Detroit, Dallas--Fort Worth, Houston, and Rest of Texas also contribute more than a trillion dollars of subnational trade. \\ \indent Large amounts of subnational shipments pass through the South and Midwest, with regions within Wisconsin, Iowa, Illinois, and Texas all for the most part contributing more than 500 billion dollars of subnational trade. Fewer shipments pass through the Western and Northeastern states, with the exceptions of major metropolitan areas in the Northeast (Boston, New York City, Philadelphia), and California. Results are inconsistent as to whether more trade passes through major metropolitan areas within a state, or the `rest of the state' as defined by the FAF which includes all counties not in that metropolitan area. For example, `Rest of Ohio' and `Rest of Pennsylvania' both process more than \$500 billion of subnational shipments, but the metropolitan areas in those states do not. In Washington, Minnesota, and the Mountain West, higher shipment values go through the major metropolitan areas than the counties outside these areas. Florida and New York have one metropolitan area that processes more shipments than non--metropolitan areas within the state, while other metropolitan areas within the state process fewer shipments.\\ \indent When comparing the geographical distribution of shipments among sectors, the identities of the regions processing the highest amount of shipments change substantially. For example, Detroit, MI processes 11.7\% of shipments in motor vehicles but 2.52\% of all subnational shipments, while Iowa processes 3.03\% of all shipments in agricultural goods but 1.27\% of all subnational shipments. However, regardless of the sector, a median .5\% of all subnational trade passes through each individual U.S. regions, with the most trade--intensive region contributing between five and ten percent. \begin{figure}[h!] \centering \begin{subfigure}[b]{0.4\linewidth} \includegraphics[width = \linewidth]{domsectpiechart.png} \caption{Domestic Destination} %ALT TEXT: breakdown of subnational trade by sector. Only for trade that does not come from outside the U.S. or leave the U.S. \end{subfigure} \begin{subfigure}[b]{0.4\linewidth} \includegraphics[width = \linewidth]{intsectpiechart.png} \caption{International Origin/Destination} %ALT TEXT: breakdown of subnational trade by sector. Only for trade that originates outside the U.S. or will be shipped outside the U.S. \end{subfigure} \caption{Subnational Trade Breakdown by Industry Category} \end{figure} \\ \indent Figure 3 displays the percentage of subnational trade belonging to major industry categories; I have further broken down the trade into shipments that a) start and finish within U.S. borders or b) either originate abroad or are eventually bound for a foreign destination. Energy goods, agricultural goods and pharmaceuticals are more commonly shipped domestically for domestic consumption, while subnational electronics shipments more often have a foreign origin or destination. Mixed freight accounts for nearly ten percent of purely domestic shipments, but a much smaller fraction of shipments that have an international component. \\ \indent Individual FAF sectors\footnote{Agriculture and energy are not individual FAF sectors, but rather amalgamations of sectors that appear separately in the FAF's categorization. Manufacturing sectors are in general broader than non--manufacturing sectors. For example, agricultural sectors include such categories as cereal grains, meat/seafood, and tobacco, while machinery is itself a category.} that account for a large fraction of subnational trade include motor vehicles, electronics, machinery, and pharmaceuticals. \begin{figure}[h!] \centering \includegraphics[width = .5\linewidth]{hhi.png} \caption{Histogram of the Herfindahl--Hirschman Index by Sector} %ALT TEXT: A histogram of the HHI for each sector, calculated according to the shares of trade in that sector from each region. \end{figure} \\ \indent I obtain an estimate of how decentralized trade is within each sector by computing a Herfindahl--Hirschman Index (HHI) using each FAF region's share of trade within that sector. HHIs below 1000 indicate a highly competitive industry, and as such trade in Figure 4 is generally quite decentralized, with most sectors having HHIs below 500. Tobacco, coal, and crude petroleum are the sole sectors with HHIs above 500, while tobacco is the sole sector to have an HHI above 1000, indicating moderate concentration. 25.7\% of tobacco shipments are sent from or received in Greensboro, NC, and 18.8\% of tobacco shipments are sent from or received in Richmond, VA. \section{Gravity Regression} \subsection{Theoretical Foundation} I assume that there are $K$ regions in the United States, and a representative consumer in city $i$ has utility function \begin{align*} \big(\sum\limits_{k=1}^K c_{kij}^{\frac{\theta_j-1}{\theta_j}}\big)^{\frac{\theta_j}{\theta_j-1}} \end{align*} for each type of good $j$. This utility function implies that the representative consumer will be spending \begin{align} E_{kij} = P_{kij}c_{kij} &= P_{ij}^{\theta_j-1}P_{kij}^{1-\theta_j}(P_{ij}c_{ij}) \end{align} on imports in sector $j$ from region $k$, where $P_{ij}$ is the aggregate price of sector $j$ goods in region $i$ and $P_{kij}$ is the specific price of goods in sector $j$ that originate in region $k$. \\ \indent Next, I assume that $P_{kij} = P_{kkj}\tau_{kij}$, where $P_{kkj}$ is the price of sector $j$ goods produced and sold in $k$ and $\tau_{kij}$ is the additional cost of shipping the good from $k$ to $i$. This $\tau_{kij}$ is the quantity of greatest interest. Taking logs of both sides of (1) yields the equation \begin{align*} \log(P_{kij}c_{kij}) &= (\theta_j-1)\log(P_{ij}) + (1-\theta_j)\log(P_{kkj}) + (1-\theta_j)\log(\tau_{kij}) + \log(c_{ij}) \end{align*} The left--hand side of the equation corresponds to the trade flows observed in the data. $\theta_j$ is the trade elasticity for sector $j$, and I can represent $P_{ij}$, $P_{kkj}$, and $c_{ij}$ with fixed costs since they do not vary across $k$ and $i$ simultaneously. The gravity equation then becomes \begin{align} \log(\text{Imports}_{kij}) &= \beta_0 + \sum\limits_{n=1}^N \beta_n X_n + \alpha_{ij} + \alpha_{kj} + \varepsilon_{kij} \end{align} where $\alpha_{ij}$ and $\alpha_{kj}$ are origin--sector and destination--sector fixed costs, respectively. These fixed costs account for regional factors that do not vary across origin--destination pairs, and I cluster standard errors by origin and destination. \\ \indent Unlike in Anderson and Van Wincoop (2003), the CES demand function introduced in this section does not contain expenditure weights across goods $k$ and does not normalize prices to one. This difference becomes irrelevant by the writing of the applied gravity regression equation (2), which is the same in both setups. This paper innovates through its application of a CES gravity structure to U.S. \textit{sub}national trade, and the different data framework required for such an estimation. \\ \indent A major problem with estimating subnational trade is that a significant fraction of locations do not trade with each other at the level of individual sectors. Feenstra and Hong (2023, working paper) addresses this issue by using a translog utility function to allow for zero trade levels between regions. However, with translog preferences, the measurement and interpretation of elasticities becomes less straightforward. A translog gravity regression as in Novy (2013) also requires the researcher to obtain measurements of the variety of goods available in each location, which is all but impossible for U.S. subnational regions. I therefore use a more straightforward CES consumer demand function and run the gravity regressions using a Pseudo--Poisson Maximum Likelihood (PPML) methodology.\footnote{Heckman (1979) proposes another method for estimating a log regression when many observations of the left--hand side variable are zeroes. However, his methodology does not allow for fixed effects to appear in the regression.} \subsection{Gravity Variables} \indent I model trade costs $\tau_{kij}$ as a function of the (logged) distance\footnote{I construct distance between U.S. regions using the NBER's County Distance Database, a data set of great--circle distances between U.S. counties based on internal points in each geographic area. I then aggregate the counties to form FAF geographic regions. For regions that are also metropolitan areas, I cross--reference the metropolitan area with a list of its constituent counties found in the Bureau of Labor Statistics' County--MSA crosswalk, and for regions that are outside metropolitan areas, I include the list of all counties in the relevant state that were not included in any of that state's metropolitan areas. \\ \indent I find the distance between regions by taking the average of the distances between the constituent counties in each regions. Mathematically, suppose that region 1 has $N_1$ counties and region 2 has $N_2$. The average distance between the two counties would then be \begin{align*} \frac{1}{N_1 N_2}\sum\limits_{i=1}^{N_1}\sum\limits_{j=1}^{N_2} \text{Distance}_{ij} \end{align*}} between regions and a binary variable indicating whether regions $k$ and $i$ are in the same state, as shipping goods across state lines may impose additional administrative or regulatory costs beyond the value of the distance. I also include gravity variables designed to be subnational counterparts to the language, currency, and other categories of binary variables commonly introduced into the standard gravity models that include multiple countries. I outline these variables below. \begin{itemize} \item Political Lean: Two regions with similar political views may be more likely to trade with one another than two regions that do not, due to shared cultural or business linkages. I obtain county--level 2020 vote totals for the two major parties from the MIT Election Data and Science Lab and aggregate these figures to get the total fraction of votes allocated to each major--party candidate at the regional level. The \textit{Political Lean} variable is equal to one for a pair of regions if both regions awarded more than sixty percent of their votes to the same candidate in 2020, and zero otherwise. \item Elevation: Since the majority of subnational shipments in the U.S. are transported by land, a substantially higher increase in elevation could inhibit trade between areas for a given level of distance. I include in the regression the product of the logged distance between regions and the average distance in elevation between those regions. I compute the average change in elevation between two regions with the same method that I use for computing the average distance between two regions.\footnote{This measure only takes into account the change in elevation between the origin region and destination region. If two regions were at the same elevation but are separated by a mountain range, the elevation variable would be zero.} \item Coastline: Two regions that both have access to the ocean may be more likely to trade with one another than two regions that do not both have access to the ocean. Alternatively, they might be less likely to trade with one another if trade with foreign nations is a substitute for shipments sent within the United States. I model this presence as a binary variable equal to one if the relevant FAF regions each border at least one of the following: the Atlantic Ocean, Pacific Ocean, or Gulf of Mexico. For example, if one region bordered the Pacific Ocean and one region bordered the Gulf of Mexico, their binary value is one. If one or both of the two regions does not border any ocean, their binary value is zero. \item Canada: The presence of a border with Canada may affect trade between two regions of the U.S. I model this variable as a binary variable equal to one if the FAF regions both border Canada and zero if at least one does not. I include regions that border one of the Great Lakes as regions that border Canada, since they may access Canada by water.\footnote{I was unable to model this border effect for Mexico because not enough FAF regions border Mexico. Out of the 17424 regional pairs, only forty--two included two regions that both have a Mexican border. I was likewise unable to model the Great Lakes and Canada separately.} \item Black/Hispanic: The presence of large minority communities may enhance trade between two U.S. regions due to cultural or business ties between those communities. The effect of these cultural ties is especially pronounced in the case of Hispanic communities, who may conduct business in Spanish. I model \textit{Black} and \textit{Hispanic} as separate variables.\footnote{I do not include an indicator variable for Asian population due to lack of variation; the median FAF region is only three percent Asian by population.} For each variable, I include in the regression a binary variable equal to one if the relevant minority group comprises at least ten percent of the population of both regions and zero if it comprises less than ten percent of at least one region's population. \\ \indent Minority population estimates are available from the U.S. Census Bureau at the county level, and I sum up county estimates by region to find the relevant population fractions in each region. I measure \textit{Black} as the fraction of the population that is Black or African American alone or in combination, and I measure \textit{Hispanic} as the fraction of the population that is Hispanic alone or in combination. I obtain population estimates by county and sum them up by region to find relevant population fractions in each region. The median FAF region is 11.1\% black and 10.6\% Hispanic. Laredo, TX has the highest concentration of Hispanic Americans at 95.3\%, and Memphis, TN has the highest concentration of Black Americans at 52.3\%. \item Region: The \textit{SameState} variable accounts for the effect of two regions being in the same state, but regional or cultural ties may also enhance trade between Census areas. I include in the regression a binary variable equal to one if both regions are part of the same broader Census area\footnote{States that comprise each Census area are as follows: New England (CT, MN, MA, NH, RH, VT, NJ, NY, PA); Midwest (IN, IL, MI, OH, WI, IA, KA, MN, MO, NE, ND, SD); South (DE, DC, FL, GA, MD, NC, SC, VA, WV, AL, KY, MS, TN, AR, LA, OK, TX); West (AZ, CO, ID, NM, MT, UT, NV, WY, AL, CA, HA, OR, WA).} and zero if at least one of the regions is not. \item Urban/Rural: Since urban regions and rural regions specialize in producing different types of goods, regions with different population concentrations may be more likely to trade with one another than regions with similar concentrations. I define a pair of regions as both being `urban' if they both have population densities above 500 people per square mile, and `rural' if they both have population densities below 100 people per square mile. I gather data on county--level populations and land areas from the Census Bureau and add them up to get population densities for the FAF regions. \\ \indent Unlike the other subnational gravity regressors, this variable is clearly vulnerable to reverse causality as trade patterns can obviously affect the concentration of population within a given geographic location. I avoid this issue by classifying regions into broader population density--based categories rather than including the population density itself as a regressor. While subnational trade could affect population movement across regions, the general categorization of regions as `urban' or `rural' is more fixed. The inclusion of this variable also removes a large source of omitted variable bias from the political lean variable, as the urban/rural nature of an area is closely related to its political orientation. \end{itemize} \section{Gravity Equation Results} I express all non--zero subnational shipment values in logs. \subsection{Aggregate Results} \begin{table}[h!] \centering \begin{tabular}{| c | c| c| c|} \hline & Aggregate & Pooled by sector & Pooled by mode \\ \hline Dist & -1.29*** & -.1.28*** & -1.40*** \\ \hline SameState & .746*** & .715*** & .704*** \\ \hline Political Lean & .137*** & .0885*** & .0735*** \\ \hline $\text{Elevation}\times\text{Dist}$ & $-9.61\times 10^{-5}$*** & $-5.10\times 10^{-5}$ & $-3.41\times 10^{-5}***$ \\ \hline Coastline & -.156* & -.150*** & -.128** \\ \hline Canada Border & -.0451 & -.0923 & -.0750 \\ \hline Black & .0400 & .120*** & .0641 \\ \hline Hispanic & .0174 & $2.01\times 10^{-3}$ & -.311*** \\ \hline Northeast & -.119*** & -.140 & -.0351 \\ \hline South & .0248 & .0471 & .0169 \\ \hline Midwest & .219** & .212*** & .223** \\ \hline West & -.103 & -.0910 & .0453 \\ \hline Urban & -.0862*** & -.106** & -.109* \\ \hline Rural & -.0528 & -.0298 & -.0580 \\ \hline Sector--region fixed effects & No & Yes & No \\ \hline Mode--region fixed effects & No & No & Yes \\ \hline Adjusted pseudo R--squared & .860 & .779 & .864\\ \hline \end{tabular} \caption{Gravity regression results, without dividing by sector or mode \\ ***: p $<$ .001; **: p $<$ .05; *: p $<$ .1} %ALT TEXT: Results of regressions without breaking down by sector or mode of transportation. \end{table} Before breaking down gravity results by sector or mode of transportation, I present results of gravity regressions on all subnational trade observations. I run the Column 1 regression on data aggregated across all sectors and modes, while Column 2 and Column 3 are run on pooled data across sectors and modes, respectively. I interact origin-- and destination--fixed effects with sectors in Column 2 and regions in Column 3. \\ \indent These preliminary results indicate that distance and state borders have strongly significant effects on subnational trade, with distance predicting lower trade and being in the same state predicting higher trade. Two regions trade more with each other if both are in the Midwest, while indicator variables for the other Census areas do not have a consistent effect on trade. \\ \indent Furthermore, the combination of subnational shipments across sectors and modes appears to introduce aggregation bias. Disaggregating across sectors causes the \textit{Black} indicator variable to become statistically significant at the one percent level, while the elevation and \textit{Northeast} cease to be statistically significant at any level. With these findings in mind, I now proceed to a set of gravity estimations broken down by sector. \subsection{Sector--Level Results} It is not possible for me to list results from all forty--two sector--level gravity regressions in a concise form. I will instead summarize the results of these regressions, then go into more detail about the coefficients on distance and \textit{SameState} as these results are larger and more significant. \\ \begin{table}[h!] \centering \begin{tabular}{| c | c| c| c| } \hline & Negative & Non--significant & Positive\\ \hline Distance & 41 & 1 & 0 \\ \hline Same State & 0 & 6 & 36 \\ \hline Political Lean & 1 & 31 & 10\\ \hline $\text{Elevation}\times \text{Distance}$ & 3 & 29 & 10 \\ \hline Coastline & 8 & 32 & 2 \\ \hline Canada Border & 6 & 32 & 4 \\ \hline Black & 2 & 34 & 6\\ \hline Hispanic & 4 & 33 & 5 \\ \hline Northeast & 7 & 28 & 7 \\ \hline South & 1 & 34 & 7 \\ \hline Midwest & 2 & 28 & 12 \\ \hline West & 2 & 33 & 7 \\ \hline Urban & 3 & 36 & 3 \\ \hline Rural & 9 & 30 & 3 \\ \hline \end{tabular} \caption{Summary of Gravity Regression Coefficients} %ALT TEXT: There are forty-two sector-level regressions. This table displays, for each gravity regressor, how many of the regressions have a negative and significant coefficient, how many had a positive significant coefficient, and how many coefficients are not statistically significant. \label{table:2} \end{table} For each variable, Table 4 shows the number of sectors, out of forty--two, that are either significantly negative, significantly positive, or not statistically significant at the ten percent level. Distance and \textit{SameState} have consistently significant impacts, with distance elasticities being negative and SameState elasticities being positive. Distance is only insignificant for transport equipment, and state borders are only insignificant for six sectors including cereal grains, coal, and crude petroleum. \\ \indent The remaining variables do not display statistically significant relationships with trade for around three quarters of sectors. This result suggests that these regional characteristics do not have a strong effect on subnational trade, especially compared to distance and state borders, but are still important for certain specific commodities Two regions that both share a coastline are less likely to trade with one another than two regions who do not, suggesting that coastal regions choose to trade with foreign nations instead of shipping goods to other regions of the U.S.\footnote{A region bordering the Pacific Ocean might be unlikely to trade with a region bordering the Atlantic Ocean due to the expense of shipping goods across the Panama Canal or around South America. However, the inclusion of distance in the regression controls for this issue. I do not make separate variables for the Atlantic and Pacific Ocean because very few of the regional pairs border the same ocean, and so the variable would have a lack of variation.} A similar political lean is more likely to predict higher trade than lower trade between regions, while rural status has the opposite effect as rural regions are unlikely to be centers of commerce for a large variety of sectors. Urban status, however, displays a positively significant relationship with trade as often as it does a negative one. \\ \indent While being in the same state strongly predicts higher shipment levels, being in the same broader Census area has much less predictive power on trade. Among the estimates that are significant, regional dummies more often have positively significant effects on trade than negatively significant effects, with Midwestern regional identity having the strongest positive relationship with subnational trade. The Northeast dummy variable, however, is just as likely to predict lower trade between regions as it is higher trade between regions. \\ \indent Energy goods such as coal, gasoline, or natural gas are generally shipped between regions with differing characteristics, as regions with high energy consumption needs are often not the same regions with large endowments of oil, gas, or coal. For example, areas with a similar political lean, areas in the Midwest and areas with a high concentration of black or Hispanic Americans all trade less crude petroleum with each other than areas that do not share these characteristics. Two regions will trade less coal, gasoline, and fuel oils if both regions are rural. However, in general, the gravity regressors in Table 4 are more likely to display statistically significant results for less frequently traded sectors than for more frequently traded sectors, as idiosyncratic local characteristics may have less impact on shipments in a commodity universally traded in large values across the country than shipments in a commodity whose scope is more limited. \begin{table}[h!] \centering \begin{tabular}{| c | c| c| c| c|} \hline Variable & Min & Median & Mean & Max \\ \hline Dist & -3.75 & -1.38 & -1.47 & -.368 \\ \hline SameState & -.416 & .816 & .778 & 1.84 \\ \hline Political Lean & -.493 & .167 & .0732 & .547 \\ \hline $\text{Elevation}\times\text{Dist}$ & $-2.87\times 10^{-3}$ & $1.89\times 10^{-4}$ & $-9.58\times 10^{-5}$ & $1.16\times 10^{-3}$ \\ \hline Coastline & -.916 & -.223 & -.173 & 1.74 \\ \hline Canada Border & -.921 & -.0189 & .171 & 1.62 \\ \hline Black & -.548 & .177 & .123 & 1.12 \\ \hline Hispanic & -.827 & .112 & .114 & 1.09 \\ \hline Northeast & -1.43 & .0102 & .309 & 4.82 \\ \hline South & -.866 & .328 & .337 & 1.59 \\ \hline Midwest & -1.07 & .267 & .238 & 1.44 \\ \hline West & -2.43 & .563 & .498 & 2.03 \\ \hline Urban & -1.71 & -.120 & -.0530 & 1.32 \\ \hline Rural & -1.37 & -.153 & -.217 & .413 \\ \hline \end{tabular} \caption{Summary statistics of gravity regression coefficients} %ALT TEXT: Distribution of regression coefficients for each gravity regressor. \label{table:2} \end{table} Just as the effects of distance and state borders on trade are more statistically significant than the other variables, Table 5 shows that they are larger in magnitude as well. Distance elasticities are well above one, and the median \textit{SameState} coefficient is close to one. \\ \indent As for other regressors, the positive effect of \textit{Black} is larger on average than that of \textit{Hispanic}, suggesting that despite the absence of language effects shared African--American demographics are more important for trade than shared Hispanic demographics. Regional effects on trade are largest for the West and weakest for the Northeast; the median effect on trade of two regions' both belonging to the Northeast is roughly zero. \\ \indent The magnitude of estimates in Table 5 are largely uncorrelated with sector--level trade frequencies. With regard to distance and state borders, the two regressors with the most explanatory power, regression estimates in a sector have nearly zero correlation with that sector's proportion of total subnational trade by value. For other regressors, the coefficients in some highly traded sectors are large in magnitude but not significant at the ten percent level, explaining why significance is somewhat related to trade frequency but magnitude is not. \\ \indent Finally, the results presented in this section underscore the importance of separating data by sectors when analyzing subnational trade. Although regional characteristics such as racial demographics or political affiliation have little impact on subnational trade in the aggregate or indeed for most sectors, they do have significant relationships on trade for certain particular sectors, a nuance lost when examining data exclusively at the aggregate level. Sector--level heterogeneity is especially valuable since many trade policy changes proposed by governments will not impact all sectors equally. \subsection{Distance and State Borders} \indent I will now go into more detail about the effect of distance and state borders on trade. Table 6 shows the coefficients on distance and a binary variable indicating whether both regions are in the same state. These results are shown both for trade by value and for trade by weight. \newpage \begin{table}[h!] \centering \begin{tabular}{| c | c | c| c| c| c|} \hline \multirow{2}{*}{Sector} & \multicolumn{2}{|c|}{Logged Trade by Value} & \multicolumn{2}{|c|}{Logged Trade by Weight} \\ \cline{2-5} & Distance & Same State & Distance & Same State \\ \hline Live animals/fish & -3.65*** & -.267* & -4.14*** & -.223\\ %& & (.159) & (.164) \\ \hline Cereal grains & -2.56*** & .866*** & -2.96*** & .827***\\ %& & (.078) & (.0957)\\ \hline Other agricultural products & -1.66*** & 1.14*** & -2.15*** & 1.50***\\ %& & (.0275) & (.0484)\\ \hline Animal feed & -1.63*** & .827*** & -2.19*** & .900***\\ %& & (.0288) & (.0525) \\ \hline Meat/seafood & -1.36*** & .396*** & -1.49*** & .377***\\ %& & (.0182) & (.0382) \\ \hline Milled grain products & -1.41*** & .0184 & -1.55*** & .239***\\ %& (.0251) & (.0520) \\ \hline Other foodstuffs & -1.39*** & .625*** & -1.73*** & .501***\\ %& (.0162) & (.0326) \\ \hline Alcoholic beverages & -1.18*** & 1.87*** & -1.48*** & 1.23***\\ %& & (.0284) & (.0545) \\ \hline Tobacco products & -1.46*** & .738*** & -1.35*** & .810***\\ %& & (.0442) & (.0818) \\ \hline Building stone & -1.87*** & 1.00*** & -2.49*** & 1.25***\\ %& & (.0446) & (.0807) \\ \hline Natural sands & -1.34*** & .972*** & -1.93*** & 1.58***\\ %& & (.0491) & (.0863) \\ \hline Gravel & -2.25*** & .849*** & -3.19*** & 1.47***\\ %& & (.0698) & (.0880) \\ \hline Nonmetallic minerals & -1.23*** & 1.03*** & -1.71*** & 1.27***\\ %& & (.0300) & (.0566) \\ \hline Metallic ores & -.825*** & 1.35*** & -1.42*** & 2.69*** \\ %& & (.0681) & (.152) \\ \hline Coal & -2.47*** & .548*** & -2.96*** & .251***\\ %& & (.0727) & (.103) \\ \hline Crude petroleum & -1.30*** & .278** & -1.49*** & -.150***\\ %& & (.0256) & (.0330) \\ \hline Gasoline & -2.22*** & .760*** & -2.33*** & .634***\\ %& & (.0505) & (.0802) \\ \hline Fuel oils & -1.85*** & 1.05*** & -2.06*** & .922***\\ %& & (.0434) & (.0636) \\ \hline Natural gas and other fossil products & -2.54*** & -.319*** & -3.17*** & -.480***\\ %& & (.0445) & (.0616)\\ \hline Basic chemicals & -1.03*** & .761*** & -1.54*** & .412***\\ %& & (.0241) & (.0500)\\ \hline Pharmaceuticals & -1.47*** & .858*** & -.891*** & 1.23***\\ %& & (.0308) & (.0612) \\ \hline Fertilizers & -1.64*** & 1.30*** & -1.97*** & 1.23***\\ %& & (.0470) & (.0838) \\ \hline Chemical products & -1.01*** & .530*** & -1.28*** & .537***\\ %& & (.0191) & (.0437) \\ \hline Plastics/rubber & -.981*** & .638*** & -1.03*** & .592***\\ %& & (.0143) & (.0301) \\ \hline Logs & -1.33*** & 1.20*** & -1.86*** & 1.30***\\ %& & (.0441) & (.0761) \\ \hline Wood products & -1.84*** & .540*** & -2.09*** & .639***\\ %& & (.0190) & (.0316) \\ \hline Newsprint/paper & -1.26*** & .552*** & -1.40*** & .531***\\ %& & (.0205) & (.0407) \\ \hline Paper articles & -1.43*** & .512*** & -1.68*** & .524***\\ %& & (.0204) & (.0411) \\ \hline Printed products & -.953*** & .418*** & -1.19*** & .778*** \\ %& & (.0200) & (.0476) \\ \hline Textiles/leather & -.808*** & .823*** & -1.11*** & .845***\\ %& & (.0146) & (.0355) \\ \hline Nonmetal mineral products & -1.53*** & .671*** & -2.15*** & .891***\\ %& & (.0195) & (.0195) \\ \hline Base metals & -1.40*** & .583*** & -1.56*** & .528***\\ %& & (.0173) & (.0321) \\ \hline Articles made from base metal & -1.22*** & .784*** & -1.47*** & .730***\\ %& & (.0176) & (.0340)\\ \hline Machinery & -.974*** & .835*** & -1.00*** & .943***\\ %& & (.0182) & (.0375)\\ \hline Electronics & -.824*** & 1.00*** & -1.09*** & .807***\\ %& & (.0156) & (.0347) \\ \hline Motorized vehicles & -.895*** & .886*** & -1.02*** & .757***\\ %& & (.0200) & (.0395) \\ \hline Transport equipment & -.384*** & 1.56*** & -.856*** & 1.92***\\ %& & (.0330)& (.0810) \\ \hline Precision instruments & -.683*** & 1.03*** & -1.03*** & 1.02***\\ %& & (.0206) & (.0500) \\ \hline Furniture & -1.02*** & .896*** & -1.21*** & .857***\\ %& & (.0160) & (.0336) \\ \hline Miscellaneous manufacturing products & -.727*** & 1.20*** & -1.34*** & .738***\\ %& & (.0133) & (.0306) \\ \hline Waste/scrap & -1.68*** & .670*** & -2.40*** & .786***\\ %& & (.0322) & (.0579) \\ \hline Mixed freight & -2.43*** & .257*** & -2.63*** & .347***\\ %& & (.0218) & (.0302) \\ \hline \end{tabular} \caption{Gravity regression results, by sector (***: p $<$ .001; **: p $<$ .05; *: p $<$ .1)} %ALT TEXT: Detailed breakdown of the relationship of distance with subnational trade and state borders with subnational trade. Each row represents the gravity regression for a particular sector. \label{table:2} \end{table} \newpage The second and third columns of Table 6 display trade elasticities with respect to distance and the presence of state borders for all forty--two sectors on which the FAF contains data. These findings suggest a consistent relationship between distance and trade at the subnational level, which contrasts with the inconsistent significance often found when estimating international gravity relationships. As the United States is a large single market with constitutional restrictions on barriers to interstate commerce, factors that complicate the gravity relationship at the international level may be absent within the United States. \\ \indent Overall, agricultural goods and raw materials have higher distance elasticities than manufactured goods, a finding explained by raw materials' higher depreciation costs and likelihood to spoil. All agricultural sectors (rows two through eight) have distance elasticities above one. Live animals and fish have the highest distance elasticity of all forty--two sectors, at -3.65. Sectors with low distance elasticities, meanwhile, include transport equipment, precision instruments, and miscellaneous manufacturing products, all of which have elasticities less than one with respect to distance. \\ \indent Being in the same state affects trade positively, with most SameState coefficients falling between one--half and one. Regions that display higher trade elasticities with respect to distance do not necessarily display higher elasticities with respect to state borders, as SameState elasticities are generally higher for manufactured goods than for raw materials or agriculture. Alcoholic beverages, miscellaneous manufacturing, and transport equipment show the highest elasticities with respect to state boundaries, implying the existence of stronger and more dispersed regulations that hinder trade between states in those sectors. Alcoholic beverages are more vulnerable than other sectors to state bureaucracy, as state policies differ with regard to the sale and distribution of alcohol. \subsection{Modes of Transportation} Since the number of transportation modes is lower than the number of sectors, I present mode--level gravity results in their entirety. \\ \begin{table}[h!] \centering \begin{tabular}{| c | c| c| c| c| c|} \hline & Truck & Rail & Water & Air & Pipeline \\ \hline Dist & -1.45*** & -.604*** & -1.04*** & .0662 & -2.34***\\ \hline SameState & .760*** & 1.12*** & $4.00\times 10^{-4}$ & .656 & -.472***\\ \hline Political Lean & .0949** & .218 & -.194 & -.364** & -.366**\\ \hline $\text{Elevation}\times\text{Dist}$ & $7.92\times 10^{-5}$ & $-4.95\times 10^{-4}$* & $-2.45\times 10^{-3}$*** & $3.82\times 10^{-4}$*** & $1.28\times 10^{-5}$ \\ \hline Coastline & -.136* & .450* & 1.06*** & -.505 & -.602**\\ \hline Canada Border & -.114* & -.254 & 1.67** & .213* & .102\\ \hline Black & .0791 & .122 & 1.36*** & -.754** & -.151 \\ \hline Hispanic & .0183 & -.402* & 1.26** & -.535*** & .129 \\ \hline Northeast & -.0133 & .728** & .847 & .796* & 2.81*** \\ \hline South & .113 & -.190 & -.477 & -.314** & .187 \\ \hline Midwest & .146** & -.818*** & -.775 & .879*** & .350* \\ \hline West & .0367 & .633 & 9.87*** & 1.21 & .998***\\ \hline Urban & -.0448 & -.555*** & -1.26*** & -.206 & -.296 \\ \hline Rural & -.0675 & -.0755 & -.0644 & -.297 & -.123 \\ \hline Adjusted psuedo R--squared & .875 & .599 & .821 & .711 & .803 \\ \hline \end{tabular} \caption{Gravity regression results, by mode of transportation} %ALT TEXT: Regression results for all gravity regressors. Each line is the regression for a particular mode of transportation. \label{table:2} \end{table} \indent Table 7 shows that distance elasticities are highest for pipeline trade, followed distantly by truck trade and water trade. If the distance between locations increases by one percent, those two locations are predicted to conduct 2.39 percent less trade by pipeline. However, distance is not important at all for trade shipped via airplane, with the coefficient being small and not statistically significant. All other distance elasticities are statistically significant at the one percent level. \\ \indent As with sectors, modes of transportation that show the greatest elasticity in magnitude with respect to distance do not necessarily display the greatest elasticities with respect to the SameState variable, and vice versa. Rail and truck transportation are most positively affected by two regions' being in the same state, and I speculate that truck and rail transportation, which rely on the manmade networks of highways and train tracks, are more affected by trade and travel regulations that may differ between states. Pipelines, meanwhile, show a negative relationship between trade and being in the same state. Pipelines transport oil across multiple states, and the transportation of oil via pipeline within a state might not be profitable. \\ \indent PPML estimations of international gravity have typically yielded distance elasticities between -.75 and -.95 (Anderson, Larch, and Yotov (2022); Yotov et al. (2016); Agnosteva, Anderson, and Yotov (2014); Santos Silva and Tenreyro (2006)). The rule of thumb for international gravity regressions more generally is that the distance elasticity will be around -1, with estimates including -.809 (Yotov, Larch, and Heid (2015)), -1.12 (Olivero and Yotov (2012)), or -1.2 (Helpman, Melitz and Rubinstein (2008)). My distance estimates are in the general range suggested by this literature, skewing slightly larger. The trade distances pertinent to subnational trade are generally smaller than the distances observed in international trade, and an increase in decrease of ten percent could make more of a difference if the original distance is 300 miles as opposed to 3000. \\ \indent Predictably, two regions that both share a coastline are more likely to trade with each over water and less likely to trade with each other by other means, with the intriguing exception of rail. A higher difference in elevation makes air travel more likely but rail and water travel less likely, implying that air transport may substitute for rail or water transport when the goods must be shipped over a large change in elevation. More intriguing are results showing that high minority representation in a pair of regions predicts higher water trade between those regions but lower trade by airplane.\\ \indent Regional identity has the strongest effect on trade in the Midwest; due to vast distances and relatively little rail infrastructure, two regions in the Midwest exchange more shipments by truck, air and pipeline and fewer shipments by rail compared to two regions that are not in the Midwest. However, the \textit{Midwest} dummy variable coefficient is not significant for water trade, despite the presence of the Great Lakes in several Midwestern states. The effect of `West' on water travel may be large because California and Washington conduct an amount of trade disproportionate to their geographic or population size. Although two regions' being in the same \textit{state} is negatively related to their trade via pipeline, their being in the same region has a significant effect on pipeline trade for all areas except the South. \subsection{Other Robustness Checks} \indent As a robustness check, I also perform gravity regressions using logged trade by volume (in tons) as the dependent variable instead of logged trade in value. Results of these regressions are reported in columns three and four of Table 5. Generally speaking, the ordinality of trade elasticities with respect to distance does not change much; live animals and fish are still the most elastic sector, and agricultural trade are more elastic with respect to distance than trade in manufactured goods. However, almost all sectors report higher elasticities for trade in tons than for the current value of trade. One explanation for this pattern is that exporters can respond to longer shipping distances by charging more, which would raise the current value of the shipment without altering the amount being shipped. \\ \indent For the majority of sectors, trade elasticities with respect to state borders are smaller for trade by weight than for trade by value. Pharmaceuticals and gravel have value elasticities well below one and weight elasticities above one, while electronics and miscellaneous manufacturing products have value elasticities above one and weight elasticities below one. Crude petroleum displays a negative elasticity for trade by weight and a positive elasticity for trade by value. \\ \indent Gravity regression results are virtually unaltered when using data from 2019, 2020, 2021, or 2022, so President Trump's trade wars and the pandemic did not have much of an impact on how regional characteristics affect subnational shipments. I also estimate gravity regressions measuring distance as a population--weighted average of the distance between each area's constituent counties rather than a simple unweighted average. This adjustment also did not cause any noticeable changes in the results. \\ \indent Lin and Sim (2012) show that the effect of logged distance on logged trade rises over time, with a coefficient of about -.75 in the 1950s and -1.25 in the 1990s. These estimates contradict my finding that gravity results vary very little over the past five years, but of course Lin and Sim are working with a much longer time period. \section{Trade Costs} \indent In Section 4.1, I introduce a theoretical foundation for estimating subnational gravity equations, in which I replaced logged trade costs with a linear function of logged distance and other bilateral regional characteristics. I can then rearrange this formulation to get trade costs for every pair $k$ and $i$. \begin{align*} \tau_{kij} &= \exp\big(\frac{\sum\limits_n \beta_n X_n}{1-\theta_j}\big) \end{align*} \indent I use this formula to back out regional pair--wise trade cost estimates broken down by sector and by mode of transportation. I estimate sector--level $\theta_j$ using data from the USITC's DataWeb. \\ \indent Due to the specification that $P_{kij} = P_{iij}\tau_{kij}$, I interpret a transportation cost $\tau_{kij}$ as the \textit{ceteris paribus} additional cost to region $i$ of consuming sector $j$ produced in region $k$ compared with the cost of consuming sector $j$ produced in region $i$ (in other words, consumption of goods produced locally). If $\tau_{kij}$ were 1.4, then a sector $j$ good produced in $k$ would be forty percent more expensive to a region $i$ consumer than a good produced in region $i$. To render this consumer indifferent between good $iij$ and good $kij$, region $k$ would need to possess some other advantage, such as higher productivity, that compensates for the additional costs imposed by trade. \subsection{Estimating $\theta_j$} I estimate Armington elasticities $\{\theta_j\}_{j=1}^{42}$ using a similar methodology to the gravity regression discussed in Section 4. A CES utility function gives us the regression \begin{align} \log(\text{Imports}_{kij}) &= (1-\theta_j)\log(f_{kij}) + \alpha_{ij} + \alpha_{kj} + \varepsilon_{kij} \end{align} where $\alpha_{ij}$ and $\alpha_{kj}$ are fixed effects, $f_{kij}$ is an international freight cost factor and $\varepsilon_{kij}$ is the error term. The difference between estimating this equation and estimating the trade costs in (2) is that here, I run a regression on international data to calibrate values of $\theta_j$, whereas I use (2) to extract subnational values of $\tau_{kij}$ given pre--calibrated values of $\theta_j$. \\ \indent The international data to estimate (3) comes from the USITC's DataWeb, and records sector--level import values for 225 exporting entities and forty--two ports of entry into the U.S in 2018. I calculate $f_{kij}$ as the ratio of the landed duty--paid value of the imports to their customs value, as in Riker (2022) and Riker (2019). Using the total trade values across all sectors, I obtain a value of $\theta = 6.82$, which lies in the middle of the 5--10 range suggested in Anderson and van Wincoop (2004). %Estimates of $\theta_j$ ranged from 1.53, for non--metallic mineral products, to 18.4, for motorized vehicles, with a median of 5.46 and mean of 6.60. These values roughly align with the range of 5--10 for Armington elasticities suggested in Anderson and van Wincoop (2004), although they suggest slightly less substitutability. The estimated value of 5.83 for furniture, mattresses, and lighting is not far from the analogous value of 4.15 estimated in Riker (2022). \\ %\indent DataWeb trade flows are not broken down by mode of transportation, so I obtain a $\theta$ to use in mode--wise trade costs by running a regression on data aggregated across sectors. In so doing, I obtain an elasticity of 6.82. \subsection{Trade Cost Results} All trade cost density plots and summary statistics discussed here exclude estimated trade costs between two regions that do not trade with each other. \\ \begin{figure}[h!] \centering \begin{subfigure}[b]{0.49\linewidth} \includegraphics[width = \linewidth]{tcostsag.png} \caption{Agricultural Goods} \end{subfigure} \hspace{1 mm} \begin{subfigure}[b]{0.49\linewidth} \includegraphics[width = .8\linewidth]{tcostsenergy.png} \caption{Energy Goods} \end{subfigure} \begin{subfigure}[b]{0.3\linewidth} \includegraphics[width = .95\linewidth]{tcosts2021.png} \caption{Chemicals/Pharmaceuticals} \end{subfigure} \begin{subfigure}[b]{0.3\linewidth} \includegraphics[width = 1.05\linewidth]{tcosts3435.png} \caption{Electronics/Machinery} \end{subfigure} \begin{subfigure}[b]{0.3\linewidth} \includegraphics[width = \linewidth]{tcosts36.png} \caption{Motor Vehicles} \end{subfigure} %ALT TEXT: Density plots of the estimated trade costs for selected highly traded sectors. \caption{Trade Cost Density Plots for Selected Sectors} \end{figure} \indent Trade cost estimates vary widely among sectors, with motorized vehicles having the lowest trade costs and non--metallic mineral products\footnote{This sector includes articles made of ceramics, glass, or asphalt, including household appliances, as well as concrete pipes and quicklime.} having the highest. Geographically, a shipment from Beaumont, Texas to the `Rest of Colorado' would incur the lowest transportation cost out of all regional pairs. \\ \indent Figure 5 shows density plots for five frequently traded sectors.\footnote{I truncate the long--tailed density plots to preserve image clarity. In all cases, the plots still show over ninety percent of the imputed trade costs, and the omitted trade costs correspond to pairs of regions that were not trading in that sector or mode in the first place.} Motorized vehicles, the most frequently traded sector, have a median trade cost of 1.40, meaning that shipping a car from one subnational region to another would, on average, add forty percent to the cost of the car. Agricultural goods generally have higher transportation costs than manufactured goods, reflecting their higher trade elasticity of trade with respect to distance discussed in Section 4. Cereal and milled grain products are especially costly to trade, with respective median trade costs of 339 and 136. Energy sector goods have transportation costs lower than agricultural goods but higher than manufactured goods, with crude petroleum having a median trade cost of 17.2 and gasoline having a median trade cost of 6.82. \\ \begin{figure}[h!] \centering \begin{subfigure}[b]{0.3\linewidth} \includegraphics[width = \linewidth]{tcoststruck.png} \caption{Truck} \end{subfigure} \begin{subfigure}[b]{0.3\linewidth} \includegraphics[width = \linewidth]{tcostsair.png} \caption{Air} \end{subfigure} \begin{subfigure}[b]{0.3\linewidth} \includegraphics[width = \linewidth]{tcostspipe.png} \caption{Pipeline} \end{subfigure} \begin{subfigure}[b]{0.495\linewidth} \includegraphics[width = .8\linewidth]{tcostsrail.png} \caption{Rail} \end{subfigure} \begin{subfigure}[b]{0.495\linewidth} \includegraphics[width = .8\linewidth]{tcostswater.png} \caption{Water} \end{subfigure} \caption{Trade Cost Density Plots for Different Modes of Transportation} %ALT TEXT: Density plots showing the distribution of estimated trade costs for each of the major modes of transportation. \end{figure} \indent As shown in Figure 6, goods shipped by airplane have the lowest trade cost, with the median only slightly higher than one. Rail shipping also imposes a fairly low trade cost. Truck shipping is more expensive, but pipeline transportation is the most costly mode by far, imposing a median trade cost of almost 1100 percent compared to a good produced locally. \\ \indent Water and rail transportation trade costs have a longer right tail than truck and pipeline transport, even though the median trade cost of truck and pipeline transportation is higher. This long tail can be explained by the distribution of different types of transportation within the United States. While rail or water may be easier for two regions directly connected by these two means of transportation, some regions have no rail or water connections at all, but the Interstate Highway System is ubiquitous and connects every region within the continental United States. The long right tail may also explain why truck shipments are so much more common than rail or water shipments despite having a higher median transportation cost. \\ \indent The trade cost estimates displayed in Figures 5 and 6, with the overall median of 5.00, are generally higher than trade costs estimated in the international trade literature. Anderson and van Wincoop (2004), which surveys the trade cost literature, determines that trade costs tend to be in the general range of 170\%, or $\tau \approx 2.7$. Egger, Larch, Nigai, and Yotov (2020) find a median logged transport cost of .9, implying that the median transport cost would be about 2.46. Anderson and van Wincoop (2003) estimate transport costs for U.S. states and Canadian provinces, finding them to be between 1.3 and 1.5 for all states; these transport cost measures incorporate both subnational and international trade. This contrast suggests that the relative cost of consuming locally--made goods versus goods sent from elsewhere in the United States is therefore higher than the relative cost of consuming American--made goods versus consuming foreign--made goods. A decreasing marginal effect of distance could drive this observed phenomenon; American--made goods already must be transported some distance even without international tariffs, and the extra distance contributed by overseas shipping does not add as much cost. Furthermore, the transport cost measures calculated here could be picking up some of the fixed cost of shipping, since I do not do any precise decomposition. Egger et al. (2020) do perform such a decomposition and find that the median ``log--scaled non--tariff transportation cost", a measure that does include fixed costs, is 6.86. \\ \indent Last, it should be noted that the trade cost computation method used in this section is more useful for comparing trade between two pairs of locales than looking at U.S. trade holistically. If distance were the most important criterion in predicting a region's trade accessibility, then coastal cities would be more expensive to ship to than cities that are farther inland and thus more centrally located within the country. We know this cannot be true, as coastal metropolitan areas such as Los Angeles account for some of the highest shares of subnational shipments. %\indent The ordinality between sectors could be caused by variation in the elasticities $\theta_j$, which were estimated using external data. To verify that our trade cost estimates are robust to our choice of elasticities, columns four through seven of Table 6 are trade costs estimated with the aggregate theta, $\overline\theta = 6.82$, where $\overline\theta$ is the elasticity estimated from DataWeb trade data summed across sectors. This alternate elasticity specification widens the dispersion of the trade cost estimates in Table 6, but preserves the ordinality. All minimum trade costs in Table 6 decrease compared to their original estimates and all maxima are greater (sometimes by wide orders of magnitude), but the costs of shipping goods in each sector continue to vary in the same order. \\ \section{Conclusion} \indent In this paper, I use data from the Freight Analysis Framework to estimate gravity regressions at the subnational level for the United States. The Freight Analysis Framework dataset encompasses 132 subnational regions, forty--two sectors, and eight major transportation categories. I make a novel contribution to the literature by estimating how gravity variables such as distance and state borders affect trade at the subnational level. \\ \indent Gravity regressions reveal a statistically significant negative relationship between logged distance and logged trade for all forty--two sectors and five major modes of transportation. Agricultural trade is generally more responsive to distance than trade in manufactured goods, with trade in energy goods falling between these two other categories. Among modes of transportation, truck and pipeline are most responsive to distance, with airplane trade not being much affected by distance. The data also show a statistically significant correlation between two regions' trade and their being in the same state, with few exceptions. \\ \indent Other regional variables such as political leanings, coastline, and demographics are significantly correlated with trade for around a quarter of the sectors included in the FAF data. A similar political lean or concentration of racial minorities predicts higher trade between regions more often than lower trade, while regions that both border the ocean are predicted to do less trade than two regions that do not border the ocean. Two regions that are part of the same Census area are more likely to exchange shipments than two regions that are not, but the correlation is not as strong as it is with two regions that are in the same state. \\ \indent I then employ the CES framework used to estimate gravity regressions to back out estimates of transportation costs. Trade cost estimates are 7.00 on average but vary widely between sectors, with motor vehicles having the lowest trade cost. Median trade costs are higher for truck shipments than rail, water, or air shipments, even though truck shipments comprise seventy percent of all U.S. subnational trade by value. \\ \indent Future research on this topic should do further analysis into some of the gravity relationships discussed in section 5, untangling the extent to which observed statistically significant relationships are causal. For example, similar political attitudes could enhance commerce and trade between two regions, but there is no causal reason why regions of a similar political bent would be more likely to trade manufactured goods than agricultural goods, without considering related factors such as population density. Future subnational gravity research should also considering incorporating gravity regressors that vary by sector or mode of transportation as well as region. However, reverse causality could pose a major challenge with incorporating such variables into the regression. Sector--level employment shares, for example, would definitely be influenced by sector--level subnational shipments. \\ \indent Finally, if such data were available, an interesting exercise would be to extend the analysis presented here to a country where subnational regions can levy tariffs on each other. The Commerce Clause of the U.S. Constitution establishes a free trade zone among U.S. states, but in other countries, such as Brazil and India, regional governing bodies can and do impose regional trade restrictions. Subnational gravity results may differ in the presence of such administrative trade barriers. \section{References} Acemoglu, Daron, David Autor, David Dorn, Gordon H. Hanson, and Brendan Price. ``Import Competition and the Great US Employment Sag of the 2000s." \textit{Journal of Labor Economics} 34, no. S1. (January 2016): S141--98. \\ Agnosteva, Delina, James E. Anderson, and Yoto Yotov. ``Intra--national Trade Costs: Measurement and Aggregation." \textit{European Economic Review} 112 (2019): 32--50. \\ Allen, Treb, and Costas Arkolakis. ``Trade and the Topography of the Spatial Economy." \textit{The Quarterly Journal of Economics} 129, no. 3. (August 1, 2014): 1085--1140. \\ Allen, Treb, and Costas Arkolakis. ``The welfare effects of transportation infrastructure improvements." \textit{Review of Economic Studies} 89, no. 6 (2022): 2911--2957. \\ Anderson, James E. and Eric van Wincoop. ``Gravity with Gravitas: A Solution to the Border Puzzle." \textit{The American Economic Review} 93, no. 1 (March 2003): 170--192. \\ Anderson, James E. and Eric van Wincoop. ``Trade Costs." \textit{Journal of Economic Literature} 42, no. 3 (September 2004), 691--751. \\ Autor, David H, David Dorn, and Gordon H Hanson. ``The China Syndrome: Local Labor Market Effects of Import Competition in the United States." \textit{American Economic Review} 103, no. 6. (October 1, 2013): 2121--68. \\ Caliendo, Lorenzo, Maximiliano Dvorkin, and Fernando Parro. ``Trade and Labor Market Dynamics: General Equilibrium Analysis of the China Trade Shock." \textit{Econometrica} 87, no. 3. (2019): 741--835. \\ Caliendo, Lorenzo, and Fernando Parro. ``Trade Policy." NBER Working Paper 29051. \\ Donaldson, Dave, and Richard Hornbeck. ``Railroads and American Economic Growth: a `market access' Approach." \textit{The Quarterly Journal of Economics} 131, no. 2 (2016): 699--858. \\ Egger, Peter, Mario Larch, Sergey Nigai, and Yoto Yotov. ``Trade costs in the global economy: Measurement, aggregation, and decomposition." WTO Staff Working Paper, 2021. \\ Feenstra, Robert, and Chang Hong. ``Estimating the Regional Impact of Tariff Changes: Application to the United States." Working Paper, 2023. \\ Fuchs, Simon, and Woan Foong Wong. ``Multimodel transport networks." Federal Reserve Bank of Atlanta Working Paper, 2023. \\ Gurevich, Tamara, and Peter Herman. ``The Dynamic Gravity Dataset: 1948--2016." U.S. International Trade Commission Working Paper Series, February 2018. \\ Hakobyan, Shushanik, and John McLaren. ``Looking for Local Labor Market Effects of NAFTA." \textit{Review of Economics and Statistics} 98, no. 4. (October 2016): 728--41. \\ Heckman, James J. ``Sample Selection Bias as a Specification Error." \textit{Econometrica} 47, no. 1. (January 1979): 153. \\ Heid, Benedikt, Mario Larch, and Yoto Yotov. ``Estimating the effects of non--discriminatory trade policies within structural gravity models." \textit{Canadian Journal of Economics} 54, no. 1 (February 2021): 376--409. \\ Helpman, Elhanan, Marc Melitz, and Yona Rubinstein. ``Estimating Trade Flows: Trading Partners and Trading Volumes." \textit{Quartrely Journal of Economics} 123 (2008): 441--487. \\ Jaworski, Taylor, Carl Kitchens, and Sergey Nigai. ``Highways and Globalization." \textit{International Economic Review} 64, no. 4 (2023): 1615--1648. \\ Lin, Faqin, and Nicholas C.S. Sim. ``Death of Distance and the Distance Puzzle." Economics Letters 116, no. 2. (August 2012): 225--228. \\ Olivero, Maria, and Yoto Yotov. ``Dynamic Gravity: endogenous country size and asset accumulation." \textit{Canadian Journal of Economics} 45, no. 1 (February 2012): 64--92. \\ Novy, Dennis. ``International Trade without CES: Estimating Translog Gravity." \textit{Journal of International Economics} 89, no. 2. (March 2013): 271--282. \\ Riker, David. ``Estimating U.S. Import Penetration in Sub-National Regions: International Trade and Economic Development." \textit{Journal of International Trade and Economic Development} 29, no. 7. (2020): 891-906. \\ Riker, David. ``State-Level Import Penetration." U.S. International Trade Commission Working Paper Series, March 2022. \\ Santos Silva, J.M.C., and Silvana Tenreyro. ``The Log of Gravity." \textit{The Review of Economics and Statistics} 88, no. 4 (2006): 641--658. \\ Yotov, Yoto V., Roberta Piermartini, Jos\'e-Antonio Monteiro, and Mario Larch. ``An Advanced Guide to Trade Policy Analysis: The Structural Gravity Model." WTO, November 15, 2016. \end{document}