Skip to content

Commit

Permalink
linux setup
Browse files Browse the repository at this point in the history
  • Loading branch information
zick2 committed Jun 19, 2020
1 parent 9e2046c commit 23bee62
Show file tree
Hide file tree
Showing 7 changed files with 10 additions and 5 deletions.
2 changes: 1 addition & 1 deletion .idea/compiler.xml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

5 changes: 5 additions & 0 deletions .idea/jarRepositories.xml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

5 changes: 3 additions & 2 deletions .idea/libraries/jsoup_1_13_1.xml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

3 changes: 1 addition & 2 deletions src/main/java/Main.java
Original file line number Diff line number Diff line change
@@ -1,7 +1,6 @@
import org.jsoup.Jsoup;
import org.jsoup.nodes.Document;
import org.jsoup.nodes.Element;
import org.jsoup.nodes.Node;
import org.jsoup.select.Elements;

import java.io.IOException;
Expand All @@ -21,7 +20,7 @@ public static void main(String[] args){
try{
//Fetching and parsing HTMl file ...
Document doc = Jsoup.connect(start_url).get();
//Extracting all <a> tags
//Extracting all <a href=""> tags
Elements links = doc.select("a[href]");
//For each tag extract the href attribute (it contains the urls we need)
for(Element url : links){
Expand Down
Binary file modified target/classes/Main.class
Binary file not shown.
Binary file modified target/classes/Spider.class
Binary file not shown.
Binary file modified target/classes/scrape.class
Binary file not shown.

0 comments on commit 23bee62

Please sign in to comment.