While I've focused on the of these lists, were you instead looking for a technical explanation of how security researchers analyze these files, or perhaps a different type of educational dataset for data science?

Use Have I Been Pwned to see if your EDU email is in a known combo list.

They provide free access to research journals (JSTOR, IEEE).