The original release of the Patent Examination Research Dataset (PatEx) contained detailed information on 9.2 million publicly viewable patent applications filed with the USPTO through December 2014. The first three updates to the dataset are available as well, the most recent posted in December 2018 (and referred to as the 2017 release). This release covered all activity through 2017, but also includes activity through mid July of 2018.
The latest version of PatEx (referred to below as the 2019 release) contains detailed information on 11.3 million publicly-viewable provisional and non-provisional patent applications to the USPTO and nearly 4.2 million Patent Cooperation Treaty (PCT) applications. It is based on data that OCE downloaded from the Patent Examination Data System (PEDS) on April 26, 2020. The PEDS data are sourced from Public PAIR. This is the first time that OCE has used PEDS as the basis of PatEx. We took the PEDS data and organized it into the familiar PatEx data files, which are based on the organization of the Public PAIR portal. The data files include information on each application’s characteristics, prosecution history, continuation history, claims of foreign priority, patent term adjustment history, publication history, and correspondence address information. However, there are some minor differences between the new PatEx release and the previous ones. Because of this, we provide new technical documentation for the 2019 release, which can be found here.
The OCE developed these data files for public use and encourage users to identify fixes and improvements. Please provide all feedback to >EconomicsData@uspto.gov.
Documentation
Original Documentation (For 2014 through 2017 Releases)
A document describing these data sets is available and can be cited as: Graham, Stuart J.H. and Marco, Alan C. and Miller, Richard, The USPTO Patent Examination Research Dataset: A Window on the Process of Patent Examination (November 30, 2015). Available at SSRN: https://ssrn.com/abstract=2702637.
Understanding how patent examination records become public is crucial to the proper analysis of the PatEx data. Thus, the document focuses primarily on the coverage of the underlying Public PAIR data and how it has evolved over time. It also includes several appendices that provide more detailed descriptions of the data elements in each of the files. These appendices can be accessed separately by clicking on the following links.
Appendix A: Description of the Application Data Tab Release
Appendix B: Description of the Transaction History Tab Release
Appendix C: Description of the Continuity Data Tab Release
Appendix D: Description of the Foreign Priority Tab Release
Appendix E: Description of the Patent Term Adjustment Tab Release
Appendix F: Description of the Address and Attorney/Agent Tab Release
Notes Regarding 2015 PatEx Data Files
New Technical Documentation (For 2019 Release)
However, if you are using the 2019 release, you should disregard the appendices above and refer to the new technical documentation. Please refer to the following technical documentation for the 2019 release: Miller, Richard D. Technical Documentation for the 2019 Patent Examination Research Dataset (PatEx) Release. USPTO Economic Working Paper No. 2020-4. Available here: https://www.uspto.gov/sites/default/files/documents/PatEx-2019-Technical-Doc.pdf.
Additional resource for the PatEx data is the paper, "USPTO Patent Prosecution and Examiner Performance Appraisal", and can be cited as: Marco, Alan C. and Toole, Andrew A. and Miller, Richard and Frumkin, Jesse, USPTO Patent Prosecution and Examiner Performance Appraisal (June 1, 2017). USPTO Economic Working Paper No. 2017-08. Available at SSRN: https://ssrn.com/abstract=2995674 or http://dx.doi.org/10.2139/ssrn.2995674
Data Files
Each of the files below can be downloaded in either Stata-14 (DTA) or CSV format.
Download a full set of data files (2014): [.dta format (5.42 GB)] [.csv format (4.33 GB)]
Download a full set of data files (2015): [.dta format (5.56 GB)] [.csv format (4.99 GB)]
Download a full set of data files (2016): [.dta format (4.98 GB)] [.csv format (4.36 GB)]
Download a full set of data files (2017): [.dta format (5.37 GB)][.csv format (4.8 GB)]
Download a full set of data files (2019): [.dta format (9.4 GB)] [.csv format (7.87 GB)]
Download individual data files (the direct download pages are here: 2014, 2015, 2016, 2017, 2019).
File Name | 2014 | 2016 | 2017 | 2019 | ||||
---|---|---|---|---|---|---|---|---|
application_data | DTA 1.53 GB | CSV 585 MB | DTA 1.1 GB | CSV 681 MB | DTA 1.01 GB | CSV 657 MB | DTA 1.03 GB | CSV 774 MB |
all_inventors | DTA 229 MB | CSV 225 MB | DTA 348 MB | CSV 347 MB | DTA 485 MB | CSV 499 MB | DTA 427 MB | CSV 417 MB |
transactions | DTA 2.55 GB | CSV 2.45 GB | DTA 2.02 GB | CSV 1.91 GB | DTA 2.21 GB | CSV 2.09 GB | DTA 2.56 GB | CSV 1.65 GB |
event_codes | DTA 75 KB | CSV 21.2 KB | DTA 36.4 KB | CSV 22.8 KB | DTA 37.8 KB | CSV 23.5 KB | DTA 40.7 KB | CSV 23.3 KB |
status_codes | DTA 8.56 KB | CSV 3.53 KB | DTA 5.87 KB | CSV 3.74 KB | DTA 6.01 KB | CSV 3.74 KB | No data | No data |
continuity_parents | DTA 49.9 MB | CSV 48.7 MB | DTA 73.2 MB | CSV 58 MB | DTA 79 MB | CSV 63.1 MB | DTA 102 MB | CSV 80.2 MB |
continuity_children | DTA 40.9 MB | CSV 40.9 MB | DTA 47.9 MB | CSV 47.7 MB | DTA 51.9 MB | CSV 51.6 MB | DTA 63.6 MB | CSV 61.3 MB |
foreign_priority | DTA 36.5 MB | CSV 35.2 MB | DTA 40.7 MB | CSV 39.4 MB | DTA 43.8 MB | CSV 41.5 MB | DTA 77 MB | CSV 47 MB |
pat_term_adj | DTA 823 MB | CSV 747 MB | DTA 1.12 GB | CSV 1.01 GB | DTA 1.22 GB | CSV 1.11 GB | DTA 1.28 GB | CSV 1.53 GB |
pta_summary | DTA 19.6 MB | CSV 16.2 MB | DTA 25.1 MB | CSV 20.1 MB | DTA 27.5 MB | CSV 22 MB | DTA 49.3 MB | CSV 33.1 MB |
pte_summary | No data | No data | No data | No data | No data | No data | DTA 531 KB | CSV 345 KB |
correspondence_address | DTA 165 MB | CSV 243 MB | DTA 236 MB | CSV 280 MB | DTA 276 MB | CSV 299 MB | DTA 350 MB | CSV 362 MB |
attorney_agent | No data | No data | No data | No data | No data | No data | DTA 3.49 GB | CSV 2.96 GB |
Additional Resources
A good primer for the art of patent examination is the Manual of Patent Examining Procedure.