You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
SELECT
tech,
ARRAY_LENGTH(tech.info) AS count,
COUNT(DISTINCT version) AS count_distinct
FROMcrawl.pages,
UNNEST(technologies) AS tech,
UNNEST(tech.info) AS version
WHEREdate='2024-11-01'AND
client ='mobile'AND
rank =1000ANDtech.technologyLIKE'WordPress%'GROUP BY1,2
Seems like it's the case whenever there are >1 category for a technology:
SELECT
tech,
ARRAY_LENGTH(tech.info) AS count,
COUNT(DISTINCT version) AS count_distinct
FROMcrawl.pages,
UNNEST(technologies) AS tech,
UNNEST(tech.info) AS version
WHEREdate='2024-11-01'AND
client ='mobile'AND
rank =1000AND
ARRAY_LENGTH(tech.categories) >2GROUP BY1,2
Having distinct values instead would make iterating over the versions more efficient.
The text was updated successfully, but these errors were encountered:
Working on HTTPArchive/dataform#48 and noticed that we have many duplicated version values:
Seems like it's the case whenever there are >1 category for a technology:
Having distinct values instead would make iterating over the versions more efficient.
The text was updated successfully, but these errors were encountered: