I created “second American car database” in 2017, sourcing data from TheCarConnection.com, and updated until December 2019. Updates are technically still possible, but worthless because the source website is inconsistent and no longer have data for ALL cars.
People interested in new models can buy first American car database which I continue to update every 2 months.
Download free SAMPLES:
American-Car-Database-No-Specs-by-Teoalida-SAMPLE.xls (5 columns)
American-Car-Database-Basic-Specs-by-Teoalida-SAMPLE.xls (28 columns)
American-Car-Database-Full-Specs-by-Teoalida-SAMPLE.xls (210 columns)
Buy complete database + FREE updates for one year:
61 car makes included
Acura, Alfa Romeo, AM General, Aston Martin, Audi, Bentley, BMW, Buick, Cadillac, Chevrolet, Chrysler, Coda, Daewoo, Dodge, Eagle, Ferrari, FIAT, Fisker, Ford, Genesis, Geo, GMC, Honda, HUMMER, Hyundai, INFINITI, Isuzu, Jaguar, Jeep, Kia, Lamborghini, Land Rover, Lexus, Lincoln, Lotus, Maserati, Maybach, Mazda, McLaren, Mercedes-Benz, Mercury, MINI, Mitsubishi, Nissan, Oldsmobile, Panoz, Plymouth, Pontiac, Porsche, Ram, Rolls-Royce, Saab, Saturn, Scion, smart, Subaru, Suzuki, Tesla, Toyota, Volkswagen, Volvo.
History
The first database I made for American market (Year-Make-Model-Trim-Specs offered since 2014) was missing some essential info, for example MSRP (manufacturer’s suggested retail price), in October 2017 one customer interested in MSRP suggested me to extract data from TheCarConnection.com, then in November 2017 another customer asked me for specific technical specs available in TheCarConnection so I included all specifications. After selling privately to 2 customers, both interested in 2017-2018 models only, I decided to scrap ALL data 1990-2018 and make database public for everyone, naming it “Second American car database”.
The URLs were not constant over time, TheCarConnection occasionally deleted cars by merging cars with similar specs. In mid-2019 they added an unique ID for each car, but they deleted nameplates discontinued before 2015 and the December 2019 database had 20% less records than previous one. I provided 2 files for June 2019 (58006 cars) and December 2019 (46107 cars) and some customers had trouble combining new file with older file to have ALL cars in 1 database.
Accuracy notes
Second American database is sold “as it is”, without corrections or additions, because any changes I do, will be lost at next update.
While specifications for each individual car are pretty accurate, filtering cars by class is troublesome due to multiple values with same meaning: “midsize car”, “midsize cars”, “mid-size cars”. Model hierarchy was slightly messed, for example Jetta is 1996-2005, 2009-2014, 2017-2018, New Jetta is 1999, Jetta Sedan 2002-2016, etc (they corrected this in mid-2019).
First American database, have nearly perfect accuracy, constant unique IDs and nothing is deleted. I also improved model hierarchy, corrected errors and expanded database with additional data not available on Edmunds. In May 2018 I made a more complex JSON scraper (instead of HTML) and added MSRP (for 2001-present cars) in first American database, which been outselling second American database 10 times, keeping updates on-going for 2 databases targeting same market became worthless.
The only advantage of second American database is MSRP for 1990-2000 models, gear ratios, dimensions for cargo area / trunk bed (length, width between wheel arches, height to roof), etc. But very few people are interested in these specs.
List of updates
In 2017 TheCarConnection was providing data for 1997-present models. As March 2018 they added 1996 models with specifications and also 1990-1995 models but only naming and MSRP, without specifications. I do not know if in the future, they will add specifications for pre-1996 cars too.
59 makes, 1136 models, 7224 model years, of which 6608 with specs, ? with trims, 46140 trims, as December 2017
61 makes, 1191 models, 8390 model years, of which 8074 with specs, 6644 with trims, 46804 trims, as 12 March 2018, published 18 March.
61 makes, 1210 models, 8651 model years, of which 8313 with specs (? full specs), 56363 trims (? full specs) as 20 September 2018.
61 makes, 1215 models, 8694 model years, of which 8313 with specs (? full specs), 56369 trims (? full specs) as 26 December 2018 (the source website did not added more specs in 3 months?)
61 makes, 1225 models, 8762 model years, of which 8439 with specs (7008 full specs), 57181 trims (49381 full specs) as 1 Feb 2019.
61 makes, 1229 models, 8790 model years, of which 8454 with specs (7023 full specs), 57312 trims (49512 full specs) as 28 March 2019.
61 makes, 1235 models, 8877 model years, of which 8547 with specs (7116 full specs), 58006 trims (50209 full specs) as 4 June 2019.
In mid-2019 TheCarConnection deleted pages of all model nameplates discontinued before 2015. They have 1990 Dodge Grand Caravan is still visible, because a new model with same name is still in production today, but 1990-2007 Dodge Caravan (non-Grand) disappeared because this nameplate is no longer available today. Strange decision!
I attempted to extract all cars and add 2019-2020 models over the 1990-2018 models from June 2019 release but I did not found enough new cars until 18 December when I found more 2020 models, but less 2018 and 2019 models than previous runs of September and December, so beside updating with new cars, they are also doing changes in old cars, deleting / merging 2 similar trims into one. Consequently, I provide 2 separate tables for June and December 2019 updates, until taking further decision.
521 models, 5629 model years, of which 5449 with specs, 44507 trims, as 16 Sep 2019.
525 models, 5696 model years, of which 5460 with specs, 44558 trims, as 07 Dec 2019.
554 models, 5785 model years, of which 5661 with specs, 46108 trims, as 15 Dec 2019.
46 makes, 538 models, 5786 model years, of which 5660 with specs, 46107 trims, as 24 Dec 2019. This was the LAST published update.
TheCarConnection does add new 2021, 2022, 2023 models, but no longer have ALL models. Scraping is technically still possible, and I did ran scraper to get number of models, without scraping each individual trim page to get specifications, because is worthless to publish updates that are missing old models.
38 makes, 472 models, 5510 model years, of which 5392 with specs, 47493 trims, as 30 May 2020. Exotic manufacturers were removed (Aston Martin, Bentley, Ferrari, Lamborghini, Rolls-Royce, etc).
38 makes, 476 models, 5544 model years, of which 5419 with specs, of with 5407 with 47724 trims, as 1 July 2020.
38 makes, 476 models, 5646 model years, of which 5532 with specs, of with 5522 with 49019 trims, as 18 October 2020.
39 makes, 454 models, 3632 model years, of which 3575 with specs, of with 3574 with 33711 trims, as 10 August 2022. Model years older than 2008 were removed.
37 makes, 443 models, 2122 model years, of which 2038 with specs, of with 2034 with 17251 trims, as 30 June 2023.
I need a database that tells me what size class of vehicle customers have when they call. Example Small, mid-size, Full-Size in the US
The above American car database does have car class as sourced from original website, which is NOT accurate, for example it have multiple variations like “midsize car”, “midsize cars”, mid-size cars”.
I suggest buying one of these 2 alternative databases:
https://www.teoalida.com/cardatabase/year-make-model/
https://www.teoalida.com/cardatabase/year-make-model-trim-specs/
Both have car class manually added by me CORRECTLY instead of being sourced “as it is” from a dodgy website.
Where can I find the makes for Heavy trucks, i.e. Peterbilt. Thanks!
If you find any website about trucks similar with KBB or Edmunds I can make an Excel database based on it, for you and other people who asked same thing
Hello,
Im interested in the full database for American. Do you have the bolt patterns for the wheels available? I did not see (or missed) them in the sample columns.
Thanks!
I’m looking for cargo capacity for vehicles 10 years old or newer – vans, cars, trucks etc. What’s the best DB for getting this info?
https://www.teoalida.com/cardatabase/year-make-model-trim-specs/ have normal cargo capacity for 64% of cars and maximum cargo capacity for 42% of cars (only volume).
this Second American car database have 20 columns such as cargo length, width, height (in addition of volume), but none of them with more than 30% completion.
Hello, Our engineering team is trying to load the databases you provided and we are running into a problem with the July 2019 and Dec 2019 spreadsheets. The Dec spreadsheet doesn’t contain some of the vehicles that are on the July spreadsheet. We cannot combine the speradsheets because that would cause duplicates. Are you able to provide one spreadsheet that encompasses both Dec and July vehicles?
You left comment in WRONG page, and https://www.teoalida.com/cardatabase/american/ at bottom I clearly explained the reason for which I provide 2 files and the problems I have in combining. If you have any solution let me know.
Since you didn’t purchased full specs database to motivate that you needed all 230 columns, but trim/naming only, what was the reason for purchasing https://www.teoalida.com/cardatabase/american/ instead of higher quality database https://www.teoalida.com/cardatabase/year-make-model-trim-specs/ that do not have such problems?
Hi- responding on the correct page now that you have directed me to. We purchased the 1990-2020 no specs option below because we don’t need all of the extra specs to get a quote for our system. Is the basic specs option for $290.03 on one file or is does this have the same issue?
I copied your earlier comments here so other people can understand what we were talking about.
No specs, basic specs, full specs… the only difference is number of columns. All packages of “Second American car database” are affected by this strange decision of TheCarConnection to remove from website all discontinued model nameplates in mid-2019. I just noticed now that they added back Saturn brand and few more discontinued cars such as Ford Bronco discontinued in 1996, but Saturn L-Series available in July 2019 update is still 404 Not Found: https://www.thecarconnection.com/specifications/saturn_l-series_2000
If TheCarConnection continue to delete and add back old cars, this database NOT a reliable solution on long-term in terms of harmonious successive updates, compared with https://www.teoalida.com/cardatabase/year-make-model-trim-specs/ based on Edmunds who never delete cars once added.
Ok, to avoid the 2 spreadsheets, we should purchase this database instead? https://www.teoalida.com/cardatabase/year-make-model-trim-specs/ and this database would be one spreadsheet no matter if we choose the no specs, basic, or full? Sorry for the confusion and thanks for your help!
Hi, just wanted to check back on my above questions?
I though that I already explained you in my previous 2 comments that https://www.teoalida.com/cardatabase/year-make-model-trim-specs/ do not have such problems at updates because Edmunds never delete old cars so you will always get one file (unless you ask specifically to give you also an older file).
I don’t know why do you still ask same thing again. If you have more doubts use live chat in lower-right corner of screen so we can talk faster.
What about specs on “navigation, sunroof, heated seats etc…