Complete Lab 3-1 Data Reduction (pages 116-120 in your text). Answer the following questions:

 

Q1.What data do you think might exist to show that a vendor is related to an employee? Which attributes would you focus on?

 

Q2.How might you attempt to detect these connections between vendors and employees?

 

Q3. If you were the employee committing fraud, what would you try to do with the data to evade detection?

 

Q4. How many vendors have similar addresses to employees?

 

Q5.What do you notice about the street vendor and employee street addresses?

 

Q6. Are there any false positives (fuzzy matches that aren’t really matches)?