oscar-corpus.com


oscar-corpus.com Website Info

oscar-corpus.com (Open Source Project on Multilingual Resources for Machine Learning) was registered first at 2020-05-01 12:56:14. DNS looks Active and website looks Accessable. oscar-corpus.com Website SEMRush Rank is 19,575,196. According to Google, website speed score is 0/100 and . Website looks safe for children. We detected the website language as en-us.
oscar-corpus.com


oscar-corpus.com Website Tags

Domain Status:
✓ Active
Is Site Accessable?:
✓ Yes
SSL(https):
✓ Yes
Title:
OSCAR
Description:
Open Source Project on Multilingual Resources for Machine Learning
Categories :
Education/Reference, Business
External Links:
125
Internal Links:
166
Canonical URL:
https://oscar-project.org/
Language:
en-us
XML Sitemap:
✓ Yes
robots.txt:
✓ Yes https://www.oscar-corpus.com/robots.txt
Favicon:
✓ Yes

oscar-corpus.com Domain & Whois Details

Refresh
Domain Create Date:
2020-05-01 12:56:14
Domain Age:
5 years, 2 months, 3 days
Domain Expire Date:
2024-05-01T12:56:14Z
Domain Last Update Date:
2024-05-01T12:56:14Z
Domain Owner:
http://domains.google.com - Google LLC -
Server Type:
GitHub.com
Nameservers:
NS-CLOUD-A1.GOOGLEDOMAINS.COM - NS-CLOUD-A2.GOOGLEDOMAINS.COM - NS-CLOUD-A3.GOOGLEDOMAINS.COM - NS-CLOUD-A4.GOOGLEDOMAINS.COM -
Hosting Location:
Country:United States, City:Colbert, Isp:Google LLC, Org:Google LLC
IP:
142.250.176.211

oscar-corpus.com Backlinks & Rankings

SEMRush Rank:
19,575,196
Semrush Rank is a proprietary score that lets you find the domains that are getting the most traffic from organic search.
SEMRush URL Links:
0
Number of links to URL according to SemRush.
SEMRush Website Links:
505
Number of links to the website according to SemRush.
SEMRush Domain Links:
505
Number of links to SemRush Domain.
SEMRush Keywords In Top 100:
32
Number of keywords where site in Google's organic search top 100.

oscar-corpus.com Social Media

Facebook Comments:
0
Facebook Shares:
1
Facebook Reactions:
0

oscar-corpus.com Website Speed (Desktop) Check Now

Speed analysis has not been completed yet. Our system will be checking this website soon.


oscar-corpus.com Website Safety

Refresh
Last Check Date:
6/25/2023 11:06:29 AM
Fortiguard:
Business
Mcafee Category:
Education/Reference
OpenDNS:
BeFirst
Cloudflare DNS:
OK
MyWot Child Safety:
99

oscar-corpus.com HTTP Headers

Refresh
Accept-Ranges :
bytes
Access-Control-Allow-Origin :
*
Age :
0
Cache-Control :
max-age=600
Connection :
keep-alive
Content-Length :
54477
Content-Type :
text/html; charset=utf-8
Date :
Sun, 04 Jun 2023 05:58:37 GMT
ETag :
"63fd10c3-d4cd"
expires :
Sun, 04 Jun 2023 06:08:37 GMT
Last-Modified :
Mon, 27 Feb 2023 20:21:23 GMT
Server :
GitHub.com
Vary :
Accept-Encoding
Via :
1.1 varnish
X-Cache :
MISS
x-cache-hits :
0
X-Fastly-Request-ID :
7cc74991d0a3e06e757152ec7c5c62f779281249
X-GitHub-Request-Id :
4360:0B2D:9D89A1:FEDD75:647C280D
X-Proxy-Cache :
MISS
X-Served-By :
cache-lga21966-LGA
X-Timer :
S1685858318.856973,VS0,VE20


oscar-corpus.com W3C HTML Validation Check Now

Last Check Date:
5/30/2023 12:00:00 AM
Errors:
1
Warnings:
0
Info:
2

oscar-corpus.com Similar Sites

Website
Title
Rank
Entropic Data - Blogging data since 1886
21,441,944
Site not found · GitHub Pages
14,040,516
Innovature - Consulting | IT Services | Digital Transformation
5,145,847
Common Crawl - Web Data for Research and Development
601,100
SEO Data Strategy Consulting | SearchDatalogy
6,251,750


oscar-corpus.com Site H Tags

Check Now
h1
OSCAR
h1
Funding provided by
h1
Search
h1
Blog posts
h1
Publications
h1
Talks
h1
The OSCAR Team
h1
Contact
h2
Patrick Teufert
h2
Partners
h2
Common Crawl
h2
DWS at the University of Mannheim
h2
Ludwig-Maximilians-Universität München
h2
Core
h2
Pedro Ortiz Suarez
h2
Prairie Institute
h2
DFKI
h2
OSCAR
h2
ALMAnaCH
h2
Inria
h2
OpenGPT-X
h2
Julien Abadji
h2
Rua Ismail
h2
Laurent Romary
h2
Benoît Sagot
h2
Collaborators
h2
Sebastian Nagel
h2
Ayyoob Imani
h2
Contributors
h2
Sotaro Takeshita
h3
Ph.D. Student
h3
Ph.D. student at LMU Munich
h3
Crawl Engineer & Data Scientist
h3
Senior Researcher
h3
Senior Researcher
h3
Research Engineer
h3
Research Engineer
h3
Funding Project
h3
Funding Organization
h3
Funding Lab
h3
Open Source Project on Multilingual Resources for Machine Learning
h3
Funding Organization
h3
Funding Institute
h3
Researcher
h3
Partner University
h3
Partner Group
h3
Partner Organization
h3
Data Scientist
h5
Cite


What is SitesDB?

SitesDB is one of the largest databases of websites and domain names on the internet, with over 40 million entries and growing. For more than 12 years, we've been manually verifying and updating website and domain details, combining human expertise with AI-powered systems to ensure the accuracy and relevance of our data.

At SitesDB, we provide in-depth technical and useful information about websites and domains, including:

  • Website meta tags
  • Domain & WHOIS data
  • General backlink and ranking statistics
  • Social media engagement stats
  • Root page speed insights
  • Website content and HTML resources
  • Website safety and security details, sourced from multiple trusted security providers
  • HTTP headers analysis
  • HTML validation reports
  • Lists of similar websites and competitors
  • Website keyword analysis, including top traffic-driving keywords
  • Heading structure (H tags) breakdown
  • Domain variations across different TLDs (Top-Level Domains)

In addition to this data, SitesDB offers a suite of website analysis tools — including Chrome CRUX, Google Lighthouse, and our own AI-enhanced algorithms — to help identify alternative websites, direct competitors, and similar sites, all continuously refined through both automated systems and human review.